December 1956 


SF IOMETRICS 


Vol. 12 No. 4 
JOURNAL OF THE BIOMETRIC SOCIETY 


The Statistical Analysis of a Complex Experiment 
Involving Unintentional Constraints 


Simplified Analysis of Sing'y Linked Blocks 
A Rankit Analysis of Paired Comparisons for Measuring 


the Effect of Sprays on Flavor C. I. Bliss, M. L. Greenwood 
and E. S. White 
Theoretical and Experimental Study of Self 
Fertilized Populations T. W. Horner 
and C. R. Weber 
The Discrimination of Interactions and Linkage 
in Continuous Variation Birger Opsahl 
Applications of the k, Statistic to Genetic 
Variance Component Analysis D. S. Robson 
Note on Wald’s Method of Fitting a Straight Line 
When Both Variables are Subject to Error E. S. Keepjng 
Recent Advances in Biometry in Japan M. Masuyama 
and M. Hatamura 
Control of Errors in Surveys Morris H. Hansen 
: and Joseph Steinberg 
The Study of Physiological Effects of Hot 
Climates J. O. Irwin 
‘Confidence Limits for Measuring the Precision 


of Bioassays C. I. Bliss 


D. J. Fi ; 
. J. Finney 
and F. W. Cope 
K. R. Nai 
air : 


INFORMATION FOR CONTRIBUTORS 


Biometrics, the journal of the Biometric Society, is published quarterly. Its 
general objects are to promote and to extend the use of mathematical and statistical 
methods in pure and applied biological sciences, by describing and exemplifying 
developments in these methods and their applications in a form readily assimilable 
by experimental scientists. It is also intended to provide a medium for exchange of 
ideas by experimenters and those concerned primarily with analysis. 

Original papers, and authoritative expository or review articles or critiques, will 
be accepted for publication if judged consistent with these general sims. Predomi- 
nantly analytical or methodological papers should contribute specifically to the 
formulation of quantitative hypotheses, to the interpretation of data, or to the plan- 
ning or analysis of experiments or surveys. Papers dealing with biological subjects 
should report conclusions of definite applicability reached by mathematical or sta- 
tistical analysis, so described as to facilitate possible use of the procedure in other 
fields of biology or related sciences. 

Technical notes or problems for consideration under the heading of QumRIms are 
invited. 


Contributions for Biometrics may be addressed to Dr. J. W. Hopkins, National 
Research Council, Ottawa 2, Canada; authors residing in the following Society 
Regions can expedite consideration of papers by submitting them to the appropriate 
Associate Editor, namely: British Reaion: until 31 December 1956, Dr. D. J. 
Finney, Department of Statistics, University of Aberdeen, Aberdeen, Scotland; 
effective 1 January 1957, Dr. S. C. Pearce, East Malling Research Station, East 
Malling, Maidstone, Kent, England. AvustrraLastan Reaion: Dr. E. A. Cornish, 
University of Adelaide, Adelaide, Australia; Frencn Recion: Dr. Georges Teissier, 
Faculté des Sciences de Paris, 1 rue V. Cousin, Paris, France. QuErres and related 
correspondence should be directed to professor G. W. Snedecor, Statistical Labora- 
tory, Iowa State College, Ames, Iowa, U. S. A. 


Manuscripts should be submitted in triplicate, with typescript doublespaced 
throughout. Marginal notes may obviate typographical difficulties presented by 
complicated formulae or tables. Tasups should be identified by arabic number and 
by a short descriptive title. I.ntusrrations should also be identified by arabic 
number and by a brief caption. (Captions should not be included in illustrations, but 
should be typewritten collectively on an accompanying sheet.) Originals should be 
approximately 8.5 x 11 in. (21.5 x 28 em.).- The original of each chari, diagram or 
graph should be executed in black on white drawing paper or board, on blue tracing 
linen, or on coordinate paper ruled in blue only; coordinate lines to be reproduced 
should be ruled in black. For printing, illustrations may be reduced to 4 or \% 
original dimensions. Lines should therefore be of sufficient thickness, and decimal 
points, periods, and stippled dots should be solid black circles large enough to repro- 
duce well. Lettering and numerals should be at least 1 mm. high when reproduced in 
a cut 3 in. (7.5cm.) wide. Photographs should be prints on glossy paper with strong 
contrasts, and if grouped in a plate should be mounted contiguously. All tables and 
illustrations should be mentioned explicitly in the text. RereRENCES (BIBLIO- 
@Rapuic) should be collectively listed alphabetically by author; textual citation by 
author and year is preferred. 


| 
i 


aa 
a 


— 
24 
a 
4 
: 
= 
: 3 
: 
° 
: 
4 
i 
al 
‘Loge 
‘a 


The Biometrie Society 


ITOMETERICS 


FouUNDED BY THE BIOMETRICS SECTION OF THE AMERICAN STATISTICAL ASSOCIATION 


TABLE or CoNTENTS 


The Statistical Analysis of a Complex Experiment Involving 
Unintentional Constraints . . D.J.FinneyandF.W.Cope 345 
Simplified Analysis of Singly Linked Blocks ... . K.R. Nair * 369 
A Raakit Analysis of Paired Comparisons for Measuring the 
Effect of Sprays on Flavor 
C. I. Bliss, M. L. Greenwood and E.S. White 381 
Theoretical and Experimental Study of Self Fertilized 
‘Populations ...... T. W. Horner and C. R. Weber 404 
The Discrimination of Interactions and Linkage in Continuous 
Applications of the k, Statistic to Genetic Variance Component 
Note on Wald’s Method of Fitting a Straight Line When Both 
Variables are Subject to Error... . . E.S. Keeping 445 
Correction to “The Use of Mitscherlich’s Regression Law in the 
Analysis of Experiments with Fertilizers” 
Frederico Pimentel Gomes 448 
Technical Proceedings of the International Biometric Symposium on 
“The Role of Biometric Techniques in Biological Research”’: 
Recent Advances in Biometry in Japan 
M. Masuyama and M. Hatamura 449 
Control of Errors in Surveys 
i Morris H. Hansen and Joseph Steinberg 462 
' The Study of Physiological Effects of Hot Climates 


J. O. Irwin 475 
Confidence Limits for Measuring the Precision of 


- Queries 


Contents of and Index to Volume 12 
Number 4 


December 1956 ‘Volume 12 


| “ah 
Se 
4 
BioassayS 1. Bliss 491 


THE BIOMETRIC SOCIETY 
General Officers 
President, E. A. Cornish; Secretary, M. J. R. Healy; Treasurer, C. I. Bliss; 
Council, F. J. Anscombe, Claudio Barigozzi, W. G. Cochran, G. M. Cox, Georges 
Darmois, B. B. Day, D. J. Finney, J. H. Gaddum, M.-P. Geppert, Americo Grosz- 
mann, P. C. Mahalanobis, Donald Mainland, Leopold Martin, Motosaburo Masu- 


yama, P. A. P. Moran, Jerzy Neyman, C. R. Rao, E. J. Williams, Frank Yates, 
W. J. Youden. 


Regional Officers 

Eastern North American Region: Regional President, D. B. Duncan; Secretary- 
Treasurer, A. M. Dutton. British Region: Regional President, D. J. Finney; Secre- 
tary, E. C. Fieller; Treasurer, A. R. G. Owen. Western North American Region: 
Regional President, D. G. Chapman; Secretary-Treasurer, Elizabeth Vaughan. Aus- 
tralasian Region: Regional President, E. J. Williams; Secretary, G. S. Watson; T'reas- 
urer, G. A. McIntyre. French Region: Regional President, Eugene Morice; Secretary- 
Treasurer, Daniel Schwartz. Belgian Region: Regional President, Desire DeMeule- 
meester; Secretary, Leopold Martin; Treasurer, A. H. L. Rotti. Italian Region: 
Regional President, Gustavo Barbensi; Secretary, L. L. Cavalli-Sforza; Treasurer, 
R. E. Scossiroli. German Region: Regional President, Egon Ullrich; Secretary- 
Treasurer, Wilhelm Ludwig. 


National Secretaries 


Denmark, N. F. Gjeddebaek; India, K. Kishen; Japan, M. Hatamura; The 
Netherlands, E. van der Laan; Sweden, H. A. O. Wold; Switzerland, H. L. LeRoy. 


Editorial Board 
Biometrics 
Editor: J. W. Hopkins; Editorial Associates and Committee Members: C. I. Bliss, 
Irwin Bross, E. A. Cornish, W. J. Dixon, Mary Elveback, Ralph Bradley, D. J. 


Finney, S. Lee Crump, Leopold Martin, Horace W. Norton, O. Kempthorne, G. W. 
Snedecor and Georges Teissier. Managing Editor: J. W. Hopkins. 


The Biometric Society is an international society devoted to the mathematical 
and statistical aspects of biology. Biologists, mathematicians, statisticians and 
others interested in its objectives are invited to become members. Through its 
regional organizations the Society sponsors regional and local meetings. National 
secretaries serve the interests of members in Denmark, India, Japan, the Netherlands, 
Sweden and Switzerland, and there are many members at large. 

Rates (in U.S.A. currency) for full membership in the Society for 1956, including dues and a sub- 
scription to this journal are: for residents of Canada and the United States $7.00; for others $4.50. 
Members of the American Statistical Association who are currently subscribing to Biometrics through 
that organization may become members of the Biometric Society on payment of $3.00 annual dues 
if resident in the United States or Canada, and of $1.75 annual dues if resident elsewhere. Information 
concerning the Society can be obtained from its Secretary, M. J. R. Healy, Statistics Department, 
Rothamsted Experimental Station, Harpenden, Herts., England. 

Annual subscription for non-members of the Biometric Society is $7.00, payable to the Managing 
Editor, Biometrics, National Research Council, Ottawa 2, Canada. Members of the American Sta- 
tistical Association may subscribe through the Secretary of the American Statistical Association at 
$4.00 per annum. 


Second-class mailing privileges authorized at New Haven, Conn. Additional 
entry at Richmond, Va. Business Office: until 31 December 1956, 360 Prospect 
St., New Haven, Conn. (formerly 52 Hillhouse Ave.); effective 1 January 1957, 
509 West Hill Road, Knoxville 19, Tennessee. Biometrics is published quarterly— 
In March, June, September and December. 


ii 


7 

a 


THE STATISTICAL ANALYSIS OF A COMPLEX EXPERIMENT 
INVOLVING UNINTENTIONAL CONSTRAINTS 


D. J. Finney 
A.R.C. Unit of Statistics and Department of Statistics, 
University of Aberdeen, Scotland 


AND 
F. W. Core 
Imperial College of Tropical Agriculture, Trinidad 


1. INTRODUCTION 


The analysis of variance is now widely known, and so thoroughly 
explained in many text books that it can be used regularly by large 
numbers of research workers who have had no special training in sta- 
tistics. Nevertheless, various techniques useful in special circumstances 
are much less familiar, though perhaps obvious to a mathematical 
statistician. Experiments that do not go exactly according to plan, 
either because of flaws in design or because of unsuspected trends and 
sources of variation, often provide illustration of the value of several 
such techniques in a single analysis. 

This paper discusses the analysis of yields from a complex factorial 
experiment on cacao. Before planting of the experimental area had 
been completed, a flaw in the design was discovered. Advantage was 
taken of this discovery to modify both the treatments and the design; 
certain constraints of the original design had already been introduced, 
however, and in any final analysis of the experiment account ought to be 
taken of these. Even without this complication, the statistical analysis 
of the experiment presents interesting features, and here these will be 
illustrated with the aid of the yields of cacao in 1954-55. This paper is 
not intended as a definitive account of the interpretation of the experi- 
ment, which, since cacao is an orchard crop, must continue for many 


years before adequate evaluation of the experimental treatments is 
possible. 


Extent of computations 


The reader may perhaps be intimidated by the amount of heavy 
computation involved. The experiment is large and complicated, so 
that the statistical analysis is inevitably laborious. However, he should 
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remember that such extensive computation is not always necessary in 
a complex experiment. But for the flaw in the original design, only the 
calculations of Sections 3 and 4 would have been required; moreover, as 
shown in Section 4, the rather tedious evaluation of many different 
standard errors can be replaced by a simple and rapidly obtained 
approximation that is good enough for most purposes. Although for 
completeness the formation of summary tables has been fully described, 
in practice not all of these might be wanted. Once the method of 
analysis has been outlined, it can be applied to yields from successive 
vears or sequences of years without difficulty, since the formulae for 
the various quantities remain unaltered and only numerical values 
have to be changed. Two or three days of computation annually, to 
a routine pattern, does not seem extravagant for an experiment of this 
size. 

Section 5 contains the most laborious calculations, but these are 
the price that must be paid for the non-orthogonality of design. An 
experienced statistician would perhaps have guessed from inspection 
of the yields and a few trial calculations that, for the 1954-55 crop, 
the complications of Section 5 could be ignored without serious harm, 
although it is difficult to be sure of this without the full analysis as 
given. The greater part of the work of Section 5, the inversion of a 
matrix, has to be done only once and is then available (Table 8) for 
any set of yields from this experiment that may require analysis in the 
future. The calculations described in Seetion- 5 took one of us many 
hours, but this phase in the analysis of yields from another year of the 
experiment might not need more than one hour. 


The experiment 


In 1949, it was agreed that the Soil Science and Chemistry Section 
of the Cocoa Research Scheme (which is administered by the Imperial 
College of Tropical Agriculture) should lay down an experiment at 
River Estate, Trinidad, in order to investigate the effects of trenching 
(the burying of waste vegetation in shallow trenches) on the growth 
and bearing of cacao trees, in comparison with surface applications of 
the same materials or of bagasse (cellulosic waste from sugar mills). 
The underlying idea was to attempt to find the most expedient method 
of réinstating the normal organic profile in cacao soils; considerable 
evidence is adducible that the gradual decline in productivity of Trinidad 
cacao-producing soils is largely due to the loss of organic matter, coupled 
with a deterioration in the biological condition of the soil consequent 
upon diminution in the supply of leaf litter (Havord [1951]; Havord 
et al., [1953, 1954]). 
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Study of any interaction between spacing of trees and soil ameliora- 
tion by the trenching and mulching treatments seemed desirable. 
Furthermore, in order to increase the generality of any recommendations 
arising from the experiment, the inclusion of four clones of cacao (propa- 
gated as cuttings) was agreed. 


2. THE DESIGN 
The original plaid square 


When the experiment was first designed (by one of us (F.W.C.) 
and Dr. K. S. Dodds), the Soil Science and Chemistry Section was 
thought to intend including the trenching comparison on a factorial 
basis, in combination with the two mulching materials, bagasse and 


cut bush. Hence the designers contemplated a 2° X 4 factorial design 
on the factors: 


S: Spacing (S, = 7.2 ft square, S: = 9 ft square); 
M: Mulch (M,; = bagasse, J/; = cut bush); 

T: Trenching (7: = untrenched, 7; = trenched); 
V: Clone (Vi, V2 ’ V3 ’ V,). 


It was then regarded as essential that the mulched plots should run in 
strips, in order to facilitate applications of mulches several times a year. 
The spacing treatments were also to be arranged in strips, so that border 


APPLIED TO WHOLE CoLUMNS 


M2 M2 M M M2 M M M2 
S2 ViT2 VaTi V2T2 wn 
Si VeT1 ViT2 Vili V2T 2 
Si V2T2 VaT2 Mit | ViTs 
Si VaT2 Viti Vals ViT2 VsT1 ViT2 
S2 Viti Wit ViTx ViT2 Val: 
FIGURE 1 


Original (1949) design. For explanation of symbols, see text. 
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effects would be minimized. An 8 X 8 plaid square design was therefore 
proposed, in which M/ and certain interactions would be confounded 
with columns, S and other interactions confounded with rows. 

The design (Fig. 1) was constructed by taking a plan appropriate 
to an 8 X 8 plaid square, assigning four randomly selected columns to 
M, and the others to M, , also assigning four randomly selected rows 
to S, and the others to S, , and then randomizing the arrangement. 
Planting was begun in 1949 and completed in 1950. During the second 
season’s planting, the non-orthogonality of the field plan was discovered; 
in particular, it contains three plots of each of the combinations 
S,M,T,V, , 8.M.T,V; , S.M,T.V, , S:M.T.V. and only one each of 
S,M,T.V, , S,\M.T,V; , S.M,T,V; , S.M.T.V; . How the basic plan 
evolved into a non-orthogonal design remains to this day a mystery. 


The final partially confounded design 


Up to this time, mulching and trenching had not begun, so that only 
factors S and V had been introduced. One possibility then was to 
transplant a sufficient number of plots so as to produce a correct design 
by interchange-of>ctones. Though this was physically possible, the 
authors had little hesitation in advising against a procedure that would 
have introduced an unpredictable and possibly heavy mortality on a 
few plots; replacement of dead plants would have produced plots of 
mixed ages, since the original cuttings were then in their second year, 
and serious biases- ia eventual comparisons between treatments might 
have resulted. 

The alternative adopted was to ‘degrade’ the plaid square into a 
series of eight randomized blocks formed by the rows, with S confounded 
between blocks. If the requirement that strips of eight plots should 
have the same mulching treatment were abandoned, the remaining 
treatments could then be introduced so as to confound unimportant 
interactions. By chanee, the top and bottom halves of the experiment 
contained equal numbers of S, and S, rows and so could be regarded as 
replicates. A relic of the original design would remain, in the constraint 
that each clone would appear twice in each column, and special steps 
might later be needed in order to take account of this. 

At this stage, the Soil Science and Chemistry Section stated a 
preference for a non-factorial set of soil treatments instead of M and T. 
The practical difficulties of digging a trench large enough to bury the 
amounts of mulch material applied annually to the surface suggested 
replacement of this form of trenching by a practice previously found 
beneficial on this estate. This involved opening shallow trenches only 
every fourth year, burying any organic materials at hand, and covering. 
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Block: 
yield total 
ViMi VeMe V3Ms3 VaMi 
425 551 611 1035 1444 785 663 573 1: 6,087 
904 892 849 527 821 447 545 1720 II: 6,705 
ViMs VaMe Vem. ViM2 
663 642 846 623 694 976 1149 1095 III:6,688 
V3sM2 Vim V3M3 ViMs V2M2 
1087 1333 1061 904 810 606 1267 1976 IV: 9,044 
ViMe Vim ViMs V2M3 V3Me2 
717 1198 653 1154 1108 1283 776 1144 V:8,033 
VaMs | | Vid | ViMe | | VaMe | Ville | Veh @ 
761 1482 795 1494 1391 1027 668 1071 VI: 8,689 
ViMa V2M: ViMe ViMs V3M3 
462 1134 1244 553 1004 960 1697 1209 VII: 8,263 
ViMs V3M2 ViM2 V2Ms ViMs 
1237 534 439 1734 1897 294 1532 453 | VIII: 8,120 
FIGURE 2 


Revised (1950) design 


The arrangement finally adopted was that shown in Fig. 2; it has a 
2 X 4” set of factors, where the symbols indicate 


V: Clone (V;, V2, Vs, Vs = cuttings of ICS 1, 6, 8, 60 respectively). 


As far as was practicable, chemical equivalence of M, , M, , M; was 
aimed at, and the organic plant materials of M, are therefore strongly 
fortified with pen manure. 

The experiment now consists of two replicates of the 32 treatment 
combinations, in eight blocks of eight plots. The effect of spacing and 
certain components of the VM and SVM interactions are confounded. 
The scheme of confounding can best be explained after a formal replace- 
ment of V and M by pairs of quasi-factors each at two levels; thus the 
four clones are made to correspond to the four combinations of two 
factors A and B and the four types of mulching to the four combinations 


S: Spacing (S; = 7.2 ft square, S. = 9 ft square) 
M: Mulch (M, = bagasse mulch, annually; 


and yields of dry cacao in 1954-55, in lb. per acre. For 
explanation of symbols, see text. 


M; = cut bush mulch, annually; 


M; = burial of organic matter in,trenches, every fourth year; 


M, = no treatment) 
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of two factors C and D. The experiment can now be regarded as a 2° in 
two replicates, where the correspondence of main effects is 


S = S, — S, (for convenience, since S; shows the higher yields), 


B=-V; +V3; + Vi, 
AB = V3 + 
C= 
D= -M, 
CD= 


If the treatments on the 64 plots are rewritten in terms of the factors 
S, A, B, C, D, it becomes apparent that S, BD, and SBD are con- 
founded in the first replicate and S, ABCD, and SABCD are confounded 
in the second. In fact, five mutually orthogonal block contrasts can 
be identified as follows: 


The contrast of (II + IV + V + VI) with (I + III + VII + VIII) estimates S, 
the contrast of (I + IV) with (II + III) estimates ABCD, 

the contrast of (III + IV) with (I + II) estimates SABCD, 

the contrast of (VI + VII) with (V + VIII) estimates BD, 

the contrast of (VI + VIII) with (V + VII) estimates SBD. 


There are, of course, two further contrasts between blocks mutually 
orthogonal and orthogonal with these five, so completing the set of 7 
d.f. These can be identified with the difference between replicates 
and the interaction of S with replicates. 


Plot size 


The gross plot size is 36 ft X 36 ft. Each plot of S, carries 16 
experimental trees, and each plot of S, carries 9. There are guard 
rows (JCS 1) between plots, those within blocks having trees at the 
same spacing as the experimental trees of the blocks and those between 
blocks having a spacing that conforms to the block above it in Fig. 2. 
Thus the net plot sizes, for experimental trees only, are 28.8 ft X 28.8 
ft for S, , 27 ft X 27 ft for S, . 


3. ANALYSIS OF VARIANCE 


Orthogonal contrasts 


Fig. 2 also shows yields per plot for 1954-55, expressed as lb. of dry 
cacao per acre, of course based upon calculations from net plot sizes. 
Naturally at this early date no adequate analysis and interpretation of 
yields is possible, but a consideration of the technique of analysis is of 
interest as an illustration of statistical methodology. In the first 
instance, the column constraints residual from the earlier design will 
be ignored and the experiment regarded simply as a 2 X 4° factorial 
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TABLE 1 
CALCULATION OF ORTHOGONAL CONTRASTS 
Treatment Symbol Sum of Total Diff. of Total Contrast 
replicates replicates 
—‘}’) 

(i) (ii) (iii) (iv) (v) (vi) (vii) 
S.ViM, 1 978 61629 128 4581 | Sum 
Si 8 1699 3313 — 109 —2635 | S 
SVM, a 1828 9691 440 1247 | A 
Si sa 1920 2115 222 —1621 | SA 
SoV3M, b 1629 3365 — 561 3337 | B 

1 sb 1100 — 2263 206 —3695 | SB 
S.ViM, ab 1810 —- 85 664 475 | AB 
Si sab 2421 3651 — 113 687 | SAB 
S.ViM2 c 2109 2067 — 189 —2261 | C 
Si sc 1572 1559 — 236 — 493 SC 
S.V2M, ac 2688 — 2235 — 200 —4415 | AC 
Si sac 3458 3405 — 494 2309 | SAC 
S.V3M2 be 2345 1491 1123 2527 BC 
Si sbe 2109 1927 — 557 —2529 | SBC 
S.V4M2 abc 2743 — 219 1051 — 379 | ABC 
Si sabe 2918 — 1843 — 522 1441 | SABC 
S.ViM; d 1095 — 5025 — 189 2875 | D 
Si sd 1965 1179 323 5483 | SD 
S2V2M3 ad 2567 — 2799 497 — 1239 AD 
Si sad 2370 — 2343 196 1709 | SAD 
S.V3M; bd 1872 1719* 546 — 121* | BD 
S, sbd 1571 — 213* — 49 1239* | SBD 
S.VsM3 abd 2673 1569 721 — 515 | ABD 
Si sabd 2283 1905 499 3105 SABD 
S.ViM, cd 990 —11047 — 112 — 459 CD 
S sed 1714 3005 502 5765 | SCD 
S.V2M, acd 917 — 4433 — 329 — 705 ACD 

1 sacd 1262 991 172 — 461 SACD 
S.V3M, bed 1667 1985 341 — 2871 BCD 
S: shed 1554 1053 500 4185 | SBCD 
S.VsM; abcd 1247 3159* — 323 — 317* | ABCD 
Si sabed 2555 3487* 433 —2393*-| SABCD 


*Involved in partial confounding; see text. 


ee: 
digg 
“AS 
4 


352 BIOMETRICS, DECEMBER 1956 


with partial confounding; further consideration of these constraints 
is deferred to §5. The first stage in the analysis is the forma- 
tion of the analysis of variance. This is most readily achieved by use 
of the formal representation of the design as a 2° with the factors 
S, A, B, C, D, followed by re-interpretation in terms of the true factors. 

Table 1 summarizes the calculation of orthogonal contrasts between 
the 64 yields for deriving main effects, interactions, and components of 
error by Yates’s method of systematic additions and subtractions 
(Yates [1937]). Columns (iii) and (iv) of this table are obtained by 
adding, from the two replicates, yields of pairs of plots with the same 
treatment and then performing the five successive series of additions 
and subtractions required by the Yates procedure; only the first and 
final columns of this calculation are shown. In columns (v) and (vi), 
differences between pairs of similarly treated plots are analysed in the 
same way. Thus the two ‘totals’ columns contain the treatment 
contrasts and the interactions of these with replicates. Up to this 
stage, no account has been taken of the confounding. 


Sums of squares 


The analysis of variance, Table 2, can now be completed, the only 
complication being that of the partially confounded contrasts. For 
example, the component for replications is the square for ‘sum’ in column 
(vi) of Table 1: 


So the sum of squares for clones is formed from A, B, and AB in column 
(iv) as 


(9691° + 3365° + 85°) + 64 = 1,644,468; 


the unconfounded 7 d.f. for the VM interaction have a sum of squares 
formed similarly from AC, BC, ABC, AD, ABD, ACD, BCD. 

The totally confounded contrast for spacing requires no modification 
from the value in Table 1, and the square, 


1 2 
— 171,500, 
appears in the inter-block section of Table 2. The partially confounded 
contrasts require special calculation, since each has an intra-block 
portion calculated from one replicate and an inter-block portion from 


j 
2 
4581 
——— = 327,899. 
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i TABLE 2 
i ANALYSIS OF VARIANCE 
Variation due to D.f. Sum of squares Mean square 
Replications 1 327 ,899 
S 1 171,500 
VM 2 114,345 
SVM 2 278 ,337 
Error (inter-block) 1 108 ,488 
Inter-block 7 1,000, 569 
V 3 1,644, 468 548, 156 
SV 3 358,191 119,397 
M 3 2,368,114 789 ,371 
SM 3 200 , 790 66 , 930 
VM (unconfounded) 7 643 , 033 | 81,398 
(intra-block BD and 2 89,551 / 
ABCD) | 
SVM (unconfounded) 7 467 , 400 54,802 
(intra-block SBD and 2 25,821 
SABCD) 
Error (intra-block) 26 2,928 ,948 112,652 
Total 63 9,726,885 


the other. These could be formed by new computations directly from 
plot yields, but are more readily obtained from Table 1 by adding and 
subtracting corresponding entries from columns (iv) and (vi). For 
example, BD is confounded in the second replicate; therefore, its intra- 
block contrast, computable from the first replicate only, is 


[1719 — (—121)] + 2 = 920, 
and its inter-block contrast is 
[1719 + (—121)] + 2 = 799. 


Similarly for ABCD, which is confounded in the first replicate, the 
intra-block contrast is 


| [3159 + (—317)] + 2 = 1421 
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and the inter-block is 

[3159 — (—317)] + 2 = 1738. 
Hence the intra-block sum of squares for BD and ABCD is 

(920° + 1421”) + 32 = 89,551, 


and the inter-block sum of squares is 
(799" + 1738") + 32 = 114,345. 


The sum of squares for error can be computed from the totals in 
column (vi) of Table 1, by summation of the squares for all the con- 
trasts except those for replicates, S X replicates, and the four involved 
in the partial confounding. The sum of the five inter-block components 
in Table 2 can then be checked against a sum of squares formed directly 
from the eight block totals, and the sum of squares for the whole table 
can be checked against a total computed from the individual plot yields. 


Tests of significance 


For preliminary tests of significance, mean squares for main effects 
and interactions in Table 2 can be compared with appropriate error 
mean squares. The intra-block portion of BD and ABCD of course 
involves the same error as the remaining 7 degrees of freedom for this 
interaction, and only a single mean square need be shown for VM. In 
the intra-block section of the analysis, the mean squares for clones and 
for mulches are significantly greater than the error mean square. None 
of the interaction mean squares is statistically significant by comparison 
with the error, but, since some of them are based on several degrees 
of freedom, they will be worth closer examination wher tables of means 
are drawn up (§4). The inter-block section of the analysis is chiefly 
of interest for the information it might give on spacing. No test of 
significance is much use, since only 1 degree of freedom for the true 
error is available and even pooling interactions with this adds only 4 
degrees of freedom. However, the mean square for spacing is not much 
larger than either the inter-block error or a mean square formed by 
inclusion of the interactions; these two are almost the same as the 
intra-block error, so suggesting that there is very little additional error 
between blocks. Although no useful standard error for the difference 
between spacings can be quoted, any such difference is evidently too 
small to have been proved significant by an experiment of this size 
even if the contrast were unconfounded, despite the fact (noted below) 


that the difference is estimated by the experiment as about ten per cent 
of the mean vield. 
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4. SUMMARY TABLES 
Rules for construction 


In any factorial experiment, it is usually desirable so to tabulate 
mean yields as to show all two-factor interactions, even when these are 
not statistically significant. Even though an interaction is not large 
enough for a confident assertion that it differs from zero to be made on 
the evidence of the one experiment, the values may be wanted for 
comparison with results from other experiments. 

For the interactions of spacing with clones and of spacing with 
mulches, no problems of confounding arise, and mean yields can be 
computed by direct averaging of individual plot values. An alternative 
process that is rather quicker in a large experiment involves further 
applications of Yates’s scheme of additions and subtractions. The 
rule for obtaining mean yields for all combinations of any set of factors 
is as follows:— 


(i) Take from column (iv) of Table 1 the values for the sum and all main 
effects and interactions of the factors to be studied; write these in a column 
in the reverse of Yates’s standard order. 

(ii) Perform additions and subtractions on these according to Yates’s scheme, 
starting from the top of the column in the ordinary manner. 

(iii) Divide each entry in the final column by 64 (more generally, by the number 
of plots in the experiment). 

(iv) Read the results as the mean yields for the combinations of the factors in 
the order in which these combinations are introduced in Table 1 but reading 
from the foot of the column. 


Thus, in order to obtain the means for the four clones, the means for 
all combinations of factors A, B are required. The grand total and the 
effects A, B, AB are written in a column in the reverse of the standard 
order and the Yates calculations completed:— 


Contrast Total (i) (ii) +64 Mean 
AB —85 3280 74600 1166 ab, or V4 
B 3365 71320 55388 865 b, or V3 
A 9691 3450 68040 1063 a, or V2 
Sum 61629 51938 48488 758 (1), or Vi 


Yates (1937) described this process also, but it appears to be less widely 
known than his process for forming the contrasts. Division by 64 
then gives the mean yields for V, , V. , V3; , Vs. In this way, the 
entries in Tables 3 and 4 have been constructed. The standard errors 
for these tables are based entirely on the intra-block error mean square, 
and are applicable to comparisons of the main effects of V and M 
(16-fold replication) or to the interactions (8-fold replication). As 
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has already been seen, no useful estimate of inter-block error is available, 
so that comparisons between wide and narrow spacing cannot be 
referred to any standard error; there is some indication of an appreciable 
(ten per cent) improvement in yield at the narrower spacing, but little 


TABLE 3 


MEAN YIELDS FOR CLONES AND SPACING 
(Ib. dry cacao per acre) 


. ICS clone Mean 
Spacing 1 6 8 60 
869 1126 792 1272 1015 
9’ 646 1000 939 1059 911 
Mean (+84) 758 1063 865 1166 963 


S.E. for interactions: +119 lb. per acre 
There is no estimate of S.E. for vertical differences. 


TABLE 4 


MEAN YIELDS FOR MULCHES AND SPACING 
(lb. dry cacao per acre) 


| 
Mulch 
Spacing | Bagasse Cut bush Trenching None Mean 
7.2 | 892 1257 1024 886 1015 
9’ | 781 1236 1026 603 - 911 
Mean (481) 837 «12461025 744 963 


S.E. for interactions: +119 lb. per acre 
There is no estimate of S.E. for vertical differences. 


confidence can be placed in this. Evidently clones V, and V, (ICS 6 
and ICS 60) have yielded substantially better than the other two, 
differences within either pair being less noticeable. Mulching with cut 
bush was the most successful of the four treatments, its superiority 
to trenched material just failing to reach statistical significance (ef. §5); 
mulching with bagasse, on the-other hand, was not significantly better 
than no treatment. The sums of squares for the two interactions. with 
spacing were not sufficiently large to give statistical significance even 
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if either of these were concentrated in a single degree of freedom, so that 
not surprisingly no striking interactions are displayed in Tables 2 and 3. 
There is some indication that V,; (ICS 8) may be relatively more suc- 
cessful at wide spacing than are the other clones. 


A partially confounded interaction 


Perhaps the most interesting interaction is the VM. In order to 
prepare a table to display this, each partially confounded component. 
of interaction must be adjusted to a value based on the replicate in 
which it is unconfounded. The calculations (Table 5) may be per- 
formed as described above. However, the value for the BD contrast 
in Table 1, namely 1719, must be replaced by the value based upon the 
second replicate alone; this was shown in §3 to be 920, which figure 
must be doubled for insertion in Table 5 so as to be comparable with 


TABLE 5 
CaLcuLATION oF VM TaBLe ApsusTED FoR PARTIAL ConFouNDING 


Contrast and 


magnitude (i) (ii) (iii) (iv) +64 Treatment 
ABCD [2842] 4827 —10653 —15068 60636 947 ViM, 
BCD 1985 —15480 — 4415 75704 51934 811 V3M, 
ACD — 4433 3409 1104 — 9466 35060 548 VM, 
cD —11047 — 7824 74600 61400 42786 669 ViM, 
ABD 1569 1272 — 7471 -—31540 79734 1246 V.M; 
BD [1840] — 168 — 1995 66600 54852 857 ViM; 
AD — 2799 3280 6012 — 8294 78554 1227 V:M; 
D — 5025 71320 55388 51080 49116 767 ViMs 
ABC — 219 — 857 —20307 6238 90772 1418 ViM; 
BC 1491 — 6614 —11233 73496 70866 1107 V3M: 
AC — 2235 271 — 1440 5476 98140 1533 VM; 
Cc 2067 — 2266 68040 49376 59374 928 ViM; 
AB 8 1710 — 5757 9074 67258 1051 ViM, 
B 3365 4302. — 2537 69480 43900 686 V:M, 
A 9691 3450 2592 3220 60406 944 VM, 
Sum 61629 51938 48488 45896 42676 667 Vil, 


other contrasts based on 64 plots. The value for ABCD is similarly 
modified. The additions and subtractions then automatically lead 
to the adjusted mean yields in the last column of Table 5. This process 
is here seen to particular advantage, since it removes all need for the 
worry about the signs of adjustments to means that occurs in the more 
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usual process of direct calculation of averages from yields of individual 
plots. : 


The average variance 


In Table 6, these means have been rearranged in an interaction 
table of conventional pattern. .At this point, a direct check that the 
interaction components BD-and ABCD have the values 58 and 89 


TABLE 6 
Mean YIELDS FOR CLONES AND MuLcHEs, ADJUSTED FOR CONFOUNDING ; 
(Ib. dry cacao per acre) 


Mulch ICS clone Mean (+84) 
| 6 8 60 
Bagasse 667 944 686 1051 837 
Cut bush 928 1533 1107 1418 1246 
Trenching 767 1227 857 1246 1024 
None 669 «548 811 947 744 
Mean (+84) 758 1063 865 1166 963 


Average S.E. for body of table: +186 lb. per acre 


respectively, as computed on a single plot basis from the totals in §3, 
is useful. The standard errors of the margins of this table are as in 
Tables 3 and 4. For components of interaction, the standard error 
depends upon what combination of unconfounded and partially con- 
founded contrasts is involved. A rough idea of precision can be ob- 
tained by assigning to entries in the body of the table an average 
standard error. This is obtained by noting that the interaction has 9 
degrees of freedom, of which 7 are unconfounded and 2 are unconfounded 
only in one replicate; hence an average variance is 


where s° is the intra-block error from Table 2. 

This average standard error can be used in a rough test of any 
component of interaction. For example, inspection of Table 6 suggests 
that one feature of the difference between the two better clones is that 


' V, responds well to bagasse whereas V, does not. In the absence of other 


reasons for subjecting this point to special examination, to single it out 
for separate test might introduce a bias into the test of significance. 
That is not to say that to make such an examination is wrong, for a 
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tentative inquiry into a contrast suggested solely by the internal 
evidence of an experiment may be the beginning of discoveries about 
previously unsuspected effects. Here this contrast is chosen only for 
illustrative purposes, and with the warning that any apparent statistical 
significance disclosed would need to be accepted with reservations on 
account of its arbitrary selection as the most notable feature of Table 6. 

Define now the symbol V,M; to represent the mean yield of clone i 
on treatment j, one of the quantities tabulated in the body of Table 6. 
The comparison that is of interest, denoted by H, is 


H = 3(V.M, ViM, + V.M,) 
= 146. 


From the average variance of the mean yields, an approximate variance 
obtained immediately is 


V(H) = (3)* X 4 X 34421 = (186)’. 


Since the standard error is even larger than H, there is clearly no 
suggestion of statistical significance for the contrast. 


The exact variance 


Had the issue of this test been in any doubt, reliance on an average 
variance might have been undesirable. Calculation of an unbiased 
estimate of variance appropriate to H requires that H be expressed as 
a linear function of the nine interaction contrasts AC, BC, ABC, --- 
ABCD used in the earlier analysis. As will be seen below, the difference 
between the two variances is negligible, and for all practical purposes 
the average variance is good enough. Details of the exact calculation 
are presented as an example of a process that is tedious though not 
inherently difficult. 

When expressed in units of a single plot, the interaction contrast 
BC can be written formally as 


BC = — V.+ V3 + + Mz Ms + M,), 


where the contents of the brackets have been taken from the definitions 
of B and C in §2 and the symbols on the right are meaningful only 
after the product has been evaluated according to ordinary algebraic 
rules. The numerical value of this contrast can be taken from column 
(iv) of Table 1, now with a divisor of 32 to reduce the quantity to units 
of a single plot difference, and is therefore 1491 + 32. Similar expres- 
sions can be written for the other eight contrasts, but of course for BD 
and ABD the numerical values must be taken from one replicate only, 
so as to use only the intra-block estimate as in §2, 
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The unique representation of H as a sum of multiples of these nine 
contrasts can now be discovered by a process analogous to the computa- 
tion of regression coefficients. ‘The justification lies in the mutual 
orthogonality of the nine. The multiplier of any contrast is the sum 
of products of the coefficients of V;/; in H and in the contrast, divided 
by the sum of squares of these coefficients in the contrast. Since the 
formula for H given above involves only four of the V;M; , only these 
need special attention in the contrast. For example, the formal equation 
for BC can be re-written, by expansion of its right hand side, as 


BC = 3(V.M, — V.M, — V.M, + + etc). 
The rule then gives for the multiplier of BC 
+ + + + X 16] = 1, 


since the full expansion of BC has 16 terms. Similarly for AC the 
multiplier is 


+ (—D@ + (-2(-D + + X 16] = 0. 
Completion of these calculations* leads to 
H = BC + ABC + BD + ABD. 


As a numerical check, values of the contrasts may be inserted; reading 
from Table 1, or here more conveniently from Table 5, 


H = (1491 — 219 + 1840 + 1569)/32 
= 146 


as before. 


Now AC, on a single plot basis, is a difference between two means 
of 32 plots, and therefore has variance s’/16. Six of the other inter- 
action contrasts have the same variance, but BD and ABCD have 
only half this precision. Hence the variance of H is 


V(H) = des” + + + 
= 58/16 
= 35204 = (188)’, 


almost exactly the same as from the average variance. 


*The arithmetic can be completed almost more rapidly than it can be described. There are, of 
course, many other ways of obtaining the expression for HW, and in this rather simple example doubtless 


a quicker way can be found. The rule proposed has the merit of generality and can be applied auto- 
matically. 
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A second example 


As a slightly more complicated example, consider the comparison 
of the difference between the average treatment responses for V, and V, . 
In the same notation, and on a single plot basis, this may be written 


H + V.M, + V.M; 3V.M, 
V.M, V.M; + 3V.M,) 


= 198, 
with approximate variance 
V(H) = (8)° X 24 X 34421 = (151)’. 
The multiplier of BC in H is, by the rule, 


()@) + + + (-H(-D + + 
+ + + X 16] = 3. 
This method leads to 
H 


3(BC + ABC + BD + ABD + BCD + ABCD) 
= 198 as before. 


The unbiased estimate of variance is therefore 


V(H) = 8(4 X ves’ + 2 X 48°) 
= 28°/9 
= (158)”. 


The difference from the approximate value is rather greater than before, 
but the two agree in showing H to be only about 1.3 times its standard 
error. 


General conclusions 


Although the sum of squares for VM in Table 2 is large enough 
to contain one component significantly greater than the intra-block 
error, Table 8 does not suggest that any interaction has occurred. So 
far as the 1954-55 yields are concerned, the effects of mulching treat- 
ments show no strong evidence of varying from clone to clone. Table 2 
gives no reason to suppose that the three-factor interaction, SVM, is 
worth more detailed examination, but it could be studied in a similar 
manner if required. 
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5. THE COLUMN CONSTRAINTS 
Non-orthogonality of columns 


The analysis so far has ignored the history of the experiment and 
the earlier attempt to impose restrictions on the columns. As men- 
tioned in §2, this has resulted in a constraint that each clone appears 
twice in each of the eight columns of plots. If there are any consistent 
differences between columns in respect of yielding capacity, the error 
variance will have been overestimated because it includes positional 
effects. The simple adjustment of removing a sum of squares of devia- 
tions of column totals is not legitimate because column differences are 
not orthogonal with mulching treatments or with the various inter- 
actions. 


Multiple regression technique 


In such a situation, the general procedure of “fitting constants” for 
the parameters that relate to the extra constraints and solving the 
normal equations can conveniently be replaced by a multiple regression 
and covariance analysis; the two are essentially the same, but the regres- 
sion calculations may fit into a more familiar routine. Outhwaite and 
Rutherford [1955] have described this type of analysis, basing it upon 
the fitting of orthogonal polynomials to the trend of the columns. If 
the complete elimination of column differences is planned (and not 
merely the fitting of a linear or quadratic trend), and no particular 
interest attaches to the polynomials of successively higher degree, the 
coefficients of any set of contrasts between columns can be used as 
the independent variates, provided only that no one of the contrasts 
is determinable as a combination of the others. In practice, a set of 
mutually orthogonal contrasts is the most convenient, as it keeps the 
covariance terms small in the next stage of the analysis, and the greatest 
arithmetical simplicity is achieved by using a set in which all coeffi- 
cients are 1,0, or —1. Therefore a suitable set of independent variates 


is , , , Where each is constant within a column and the 
values are:— 
Column:— 1 2 3 4 i) 6 7 8 
x1 =-l1 1 0 0 0 0 0 0 
% 0 1 0 0 0 0 
7% =-l1 -1 1 1 0 0 0 0 
Zs = 0 0 0 0 0 0 -1 1 
= 0 0 0 -1 1 1 
=-l1 -1 -1 1 1 1 1 
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A covariance analysis of y on x, , 22, *-- , 47 can now be made 
according to the familiar rules. The first step is the examination of the 
error regression. This requires the error sums of squares and products 
of the z,; that correspond with the intra-block error sum of squares 
(26 d.f.) in Table 2. The process of Table 1 has been followed exactly 
to give columns corresponding to the “difference between replicates” 
section of that table for each z;. Sums of squares and products of the 
appropriate 26 totals have been formed and divided by 64; the results 
are recorded in Table 7. 


TABLE 7 
ERROR SuMS OF SQUARES AND Propucts (26 D.F.) FOR °** 
Each Entry Requires to be Divided by 8 


x 56 8 0 8 —20 —16 0 
87 —12 13 -7 2 
2X3 107 —10 7 9 74 
71 13 4 —12 
Zs 66 — 9 18 
Zs 127 14 
300 


Next is formed the inverse matrix of the entries in Table 7, sometimes 
termed the set of Gauss multipliers. This calculation will be familiar 
to all who have worked with multiple regression, and is well described 
in many text books (e.g. Fisher [1950], §29; Goulden [1952], §§8.6 and 
8.7). This inverse matrix is shown in Table 8, the entries in it being 


TABLE 8 
INvERSE Matrix OBTAINED FROM TABLE 7 


2X2 Z3 ‘2s Ze 
z, 0.189115 —0.032531 —0.007806 —0.041562 0.077960 0.027405 —0.005477 
0.104492 0.005174 0.029076 —0.037190 —0.000585 1.001449 
X3 0.091870 0.012727 —0.010533 —0.005205 —0.021312 
% 0.133459 —0.047930 —0.004289 0.005081 
Zs 0.168207 0.020042 —0.010099 
Zs 0.068428 —0.003280 
0.032876 
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determined by the conditions that the sum of products of values in any 
pair of corresponding rows of Tables 7 and 8 is unity and the sum of 
products for any other pair of rows is zero. Tables 7 and 8 depend 
upon the design of the experiment but not on the particular set of yields 
to be analysed; hence, having once been calculated, they can be used 
in analyses of yields or other observations relating to any subsequent 
years or combination of years. 

The error sums of products for x; with y in the analysis of covariance 
are obtained in the obvious manner: each of the appropriate 26 contrasts 
in column (vi) of Table 1 is multiplied by the corresponding quantity 
for x; , and the sum of these products is divided by 64. The results are:— 


— 191.75, 
2112.69, 
— 534.69, 
—1438.44, 
938.25, 
307.06, 
2555.38. 


The error regression coefficients of y on the x; are then formed as the 
sum of the products of these amounts with each row of Table 8 in turn, 
which operations give 


b, = 26.5326, 
= 151.0361, 
bs = —120.9422, 
bs = —162.6831, 
bs = 119.2241, 
be = 33.8960, 
br = —-81.7262. 


The sum of squares for column differences is the sum of products of 
the b; with the error sums of products of 2; , y: 


26.5326 & (—191.75) + --- + 81.7262 X 2555.38 = 943,793, 


from which follows the analysis in Table 9. The ratio of mean squares 
is not statistically significant, and in fact there is little indication that 
the columns really differ in their yielding capacities independently of 
the treatments applied. Rough examination of yields for each of two 
previous years also showed practically no sign of column effects, but 
there remains a possibility that positional differences may develop more 
strongly as the experiment continues. 
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TABLE 9 


ANALYsis OF Error Sum oF SQuaREs IN TABLE 2 
FoR oF CoLUMN DIFFERENCES 


Variation due to Dt. Sum of squares Mean square 
Columns 7 943 134,828 
Residual 19 1,985,155 104, 482 

Total 26 2,928,948 


Adjustment of mean yields 


The column differences are not sufficiently marked to affect greatly 
the precision of the experiment, and no full adjustment for them will 
be made here. That is not to say that they should be ignored for the 
future; when the experiment has continued for several years and an 
analysis of total yields for a period is made, elimination of column 
effects may become important to the precision. Indeed, since the con- 
straints on columns were deliberately introduced into the design, no 
analysis should be considered strictly correct without this elimination, 
although Table 9 makes clear that conclusions about the 1954-55 yields 
will be unaffected by it. Except for the very much more laborious 
calculations, the situation is the same as that of a Latin square analysis 
in which the mean square for columns is little different from that for 
error. Here, as an example, it will suffice to deal with main effects of 
V and M, for the column effects are almost certainly too small to affect 
seriously the conclusion that the VM interaction is at present unim- 
portant. The adjustments required are made exactly as in an ordinary 
covariance analysis. 

The design has secured that each clone occurs twice in each column, 
and therefore differences between clones in respect of any of the 2; 
are zero. Hence no adjustments to the mean yields of the four clones 
are required, and the only change is that their standard error should be 
computed from the residual mean square in Table 9; the standard 
error for clone means shown in Tables 3 and 6 should be changed from 
+84 lb. per acre to +81 lb. per acre. 

For mulch treatments, the process is more complicated. Perhaps 
the easiest form of calculation is to tabulate total contrasts for C, D, 
and CD in each of the x; , exactly analogous to those for yield in the 
first column of totals in Table 1. If a complete adjustment of all 
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results in respect of column differences were contemplated, construction 
of seven tables on the pattern of Table 1 would probably be the most 
convenient start. Here the three contrasts required have been picked 
out more directly and summarized in Table 10. Each yield contrast 


TABLE 10 
ADJUSTMENT OF CoNTRASTS FOR Factor M 


Variate Yield - 
Contrast 2 Zs Ze | Unadjusted Adjusted 
C 0 -2 2 2 -4 -6 -—4 2067 3944 
D 4 -4 2 -2 -8 8 — 5025 — 5826 
CD -4 2 6 4 2 -6 -12 — 11047 —8921 


is then adjusted to an estimate of what it would have been if the con- 
trasts in x; had been zero (i.e. perfect balance over columns) by sub- 
tracting the sum of products of each b; with the corresponding 2; 
contrast. Thus for C the adjusted value is 


2067 + 2b. 2b; 2b, + 4b; 6b, + 4b, 3944. 


From the adjusted contrasts, mean yields for the four treatments 
(Table 12) have been constructed as described in §4. 


Average variances 


The variance of a difference between any pair of these adjusted means 
depends upon which pair is to be examined, and in particular upon the 
corresponding differences in respect of the z; , as is usual in a covariance 
analysis. However, for most purposes it is sufficient to ascribe an aver- 
age variance or standard error to the means, by use of the formula 
given on p. 436 of Outhwaite and Rutherford [1955]. This involves 
forming from Table 10 the sums of squares and products for factor M 
(3 d.f.) in respect of the z; , which are shown in Table 11; thus for z, 
and x; the entry is 


X 2+ 4x (-—4) + 2 X 6] + 64 = —0.1250. 


The sum of products of corresponding entries in Tables 8 and 11 is 
0.8064 (remembering that each table requires to be completed by a 
bottom left corner symmetrical with the top right), which must be 
divided by the number of degrees of freedom for M and added to unity 
to give a variance adjustment factor of 1.2688. The variance of a mean 
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TABLE 11 
Sums or SquaREs AND Provucts For M (3 v.F.) FOR 21, 22, *** 21 
z, 0.5000 —0.3750 -—0.1250 —0.3750 0 0.8750 0.2500 
0.3750 -—0.1250, 0.1875 0.0625 -—0.5000 0.2500 
0.8750 0.3125 0.1875 -—0.2500 -—1.7500 
0.3750 —0.0625 -—0.8125 —0.6250 
Zs 0.3750 0.4375 —0.3750 
Xe 2.1250 0.5000 
3.5000 


yield for M, instead of being simply the residual mean square in Table 9 
divided by 16, is this quantity multiplied by 1.2688: 
10408? 1.2688 = (91)’. 
Thus the average standard error for comparisons amongst adjusted 
means is slightly greater than the approximate value in Tables 4 and 6 
that neglected to allow for column constraints. The increase is the 
price that must be paid for the non-orthogonality of the design, which 
here fails to be fully compensated by the reduction in the error mean 
square of Table 9 as compared with that of Table 2. Table 12 sum- 


TABLE 12 


Megan YIELDS FoR MuLcHES, ADJUSTED FOR COLUMNS 
(Ib. dry cacao per acre) : 


Bagasse Cut bush Trenching None Mean 


853 1255 950 794 963 


S.E.: +91 lb. per acre 


marizes the information on mulches, and shows that adjustment for 
columns has brought out more clearly the difference between cut bush 
and trenched material. 


6. SUMMARY 


A factorial experiment on cacao now in progress in Trinidad pre- 
sents several interesting features as an example of the design and 
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analysis of field experiments. Originally conceived as a plaid square 
for a 2° X 4 set of treatments, circumstances forced a change to partial 
confounding in randomized blocks for a 2 X 4° set. The change was 
made after certain factors had been introduced, and therefore some 
constraints from the earlier design remained. 

Yields from the experiment in 1954-55 are used in an illustration 
of the systematic analysis and summarizing needed for the partially 
confounded design, neglecting the additional constraints. The con- 
struction of tables of mean yields and the calculation of standard errors 
appropriate to various comparisons are emphasized. A multiple regres- 
sion technique is then used in order to take account of the other con- 
straints. 


| 
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Note added in proof 


The analysis of the yields of this experiment for the 1955-56 crop 
season reveals a column effect significant at the p .001 level. This 
abrupt change in the level of significance is explained by the consider- 
able fall in yield on good plots in columns 1 and 2 and the maintenance 


of good yields elsewhere in the experimental area, particularly in columns 
6, 7 and 8. 
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SIMPLIFIED ANALYSIS OF SINGLY LINKED BLOCKS 


K. R. Narr 


Forest Research Institute, Dehra Dun, U. P., India, and 
Institute of Statistics, The Consolidated University of North Carolina, North Carolina, 
At. 


INTRODUCTION 


The dual of a balanced incomplete block design having the param- 
eters v*, k*, r*, b*, X\* can, in the language of Youden [1951], be called 
“\*-linked blocks” since every pair of blocks of the dual design will 
have \* treatments in common. When d* = 1, the dual is called Singly 
Linked Blocks. Shrikhande [1952] and Roy [1954] independently 
showed that it is a p.b.i.b. design with two associate classes having the 
following values for the various parameters: 


* 
= — + 1); k=r*; r= k*; b= r*k* —r* +1 = 0* 


1 =1 =0 
m=rk-1) nm =(k—nr(r 1)(k - 1)/r 
— 2) + (r — 1)? (r — 1)(k — 7) 
irfk-r—1) 1) — kK 


Bose and Shimamoto [1952] divided all p.b.i.b. designs having two 
associate classes into five distinct types. Singly Linked Blocks was 
listed as one of these types. More recently, however, Bose, Clatworthy 
and Shrikhande [1954] have recast some of these types with the result 
that singly linked blocks has got submerged under what they call 
“Simple type’. 

In both these papers by Bose ef al, the analysis of p.b.i.b. designs 
having two associate classes has to some extent been simplified with 
the aid of four auxiliary parameters c, , c., Aand H. But this simplifi- 
cation does not naturally go far enough when a type or sub-type is 
separately considered. For instance, the analysis of Triangular Singly 
Linked Blocks given by Nair [1953] is much simpler than what it would 
have been by the use of the auxiliary parameters c, , c, , A and H devised 
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by Bose cf al. Similarly, the method of analysis of Square Lattices 
given by Cochran and Cox [1950] is the simplest possible for this sub- 
type (see Nair [1952]). 

The purpose of this paper is to give a similar simplified method of 
analysis for Singly Linked Blocks. This method will be applied to the 
illustrative example of a single linked blocks design worked out in 
Chapter III of the bulletin by Bose, Clatworthy and Shrikhande [1954], 
in order to demonstrate how much simpler it is compared to their method. 

This simplified method was developed by the author in the spring 
of 1952 while working at the Institute of Statistics, the Consolidated 
University of North Carolina, Raleigh during the tenure of a joint 
Fulbright and Smith-Mundt Fellowship awarded to him by the United 
States Government. However, due to lack of a suitable illustrative 
example at the time, its publication was deferred.* 


SIMPLIFIED ANALYSIS 


The general formula (37) for estimating the effect of a treatment 
with recovery of inter-block information, given by Nair’s [1952] paper 
can be simplified to yield the following expression for the adjusted 
treatment fofal with recovery of inter-block information: 


> {Qi} (1) 
where 


(w — w’)/[bw + (k — (2) 


Q.;) = Total yield B,;, of block (7) minus sum of the mean 
yields of the k treatments appearing in that block (3) 


T; = Unadjusted total yield for treatment 7. (4) 


and >>, ., in (1) denotes summation over the r values of Q,;, pertaining 
to the r blocks in which treatment ¢ occurs. 
It will be seen that 


(i) 


where ).;;),, denotes summation over the k values of 7; pertaining 
to the k treatments occurring in the (j7)th block. 

Dividing (1) by r we obtain the adjusted mean with recovery of 
inter-block information for treatment 1. 


*The author was aware that the experiment on oats by Dr. Middleton used by Bose et al. for 
illustration was laid out during 1951. The yield data were however not available before he left Raleigh 
in June 1952. 
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To obtain the corresponding intra-block estimates we have only 
to assume w’ = O ory = 1/b. 
For estimating the values of w and w’ from the data we have to 


perform the analysis of variance described below and presented in 
Table 1. 


TABLE 1 
ANALYsIS OF VARIANCE 
Source of Degrees of Sum of squares Mean 
variation freedom square 
r b 
Blocks (adj.) Mn E, 
7=1 
Treatments 
(unadj.) e-1 Ti - 
Intra-block 
vr — b-—v+1 (By subtraction) E, 
or 2 
Total or — 1 
or 
Blocks b 1 1 > B? G 
(unadj.) ke bk 
b 
(adj.) v 1 b + k Buy E, 


The total sum of squares, the treatment sum of squares (unadjusted 
for block effects) and the block sum of squares (unadjusted for treat- 
ment effects) are calculated in the usual straightforward way. 

It is much easier and simpler in the case of singly linked blocks 
first to calculate the sum of squares for blocks (adjusted for treatment 
effects) than to calculate the sum of squares for treatments (adjusted 
for block effects). The former is given by the elegant expression 


b 
Block 8.8. (adjusted) = (6) 


The latter can then be obtained using the well-known relation: 
Treatment 8.S. (Adj.) + Block 8.8. (Unadj.) 
= Treatment 8.8. (Unadj.) + Block 8.S. (adj.) (7) 
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The variance ratio F = E,/E, provides a test of significance of 
intra-block estimates of treatment effects. 

Nair [1944] had shown that for a non-resolvable incomplete block 
design (such as the singly linked blocks) the estimates of w and w’ 
are given by 


(8) 


Hence, estimate of » follows as 


E, E, 
A= OR, 


If E, turns out to be less than E, it is conventionally assumed that 


w = w’ or wn = 0. In such a case the unadjusted treatment means 
T,/r are compared as if the experiment was in randomized complete 
blocks. 

For calculating the lowest significant differences among the adjusted 
treatment means we have to calculate the variance of the difference 
between every pair of them. These pairs fall into groups, namely, 
those pairs which occurred together in the same block and those pairs 
which did not occur together in the same block. 

(a) Variance of the difference between pairs which have occurred 
together in the same block can be obtained by direct substitution in 
formula (39) of Nair’s [1952] paper. It simplifies to the form 


(b) Variance of the difference between two treatments which have 
not occurred together in the same block can be obtained by substitution 
in formula (40) of Nair’s [1952] paper. It simplifies to the form 


+m) (11) 


(c) The mean variance of differences for all pairs of treatments is 
derivable from formula (41) of Nair’s [1952] paper and simplifies to 


the form 


By substituting » = 1/6 in (10), (11) and (12) we obtain the cor- 
responding variances for intra-block estimates of treatment effects. 
It is interesting to note that the simplified analysis for Triangular 
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Singly Linked Blocks given by Nair [1953] can be deduced as a special 
case of the above by substituting 


v = 3p(p — 1), r= 2, b=p 
ILLUSTRATIVE EXAMPLE 


The data for this example have been taken from pp. 28-29 of Bose, 
Clatworthy and Shrikhande [1954]. They consist of the yield of 35 
varieties of oats in an experiment conducted in 15 blocks of 7 plots 
each arranged according to the singly linked block design. The values 
of the various parameters of the design are:— 


v = 35, k=7, r=8, b 
=1 = 0 
nm, = 18 Nn, = 16 


= Pic = 
8 8 9 6 


Table 2A, giving the field plan, shows the block number (j) in the 
first column. In the second column opposite each block number (and 
in the same row) are given the treatments (7) appearing in this block, 
and below each treatment number 7 is shown the corresponding yield. 
Adding all the yields of all the plots in the block we get B,;). This is 
the topmost figure in the cell of column (8) relating to each block. 
Summing up the block totals B,;) we get the grand total G of all the 
observations, namely 41763. The grand mean is G/vr = 397.743. 

Table 2B, giving the treatment allocation plan, shows the treatment 
number, 7, in the first column. In the second column opposite each 
treatment number (and in the same row) are given the blocks (j) in 
which this treatment appears, and below each block number (j) is 
shown the corresponding observed yield of the treatment. The top 
figure in each cell of column (3) gives the treatment total 7; obtained 
by adding the yields in the second row of the second column opposite 7. 

These values of 7’; are reproduced in the third row of each cell of 
the second column of Table 2A. Sum of the values of 7; for block (7) 
is denoted by >> ,;), (7';) and given below the block totals B,;, in column 
(3) of Table 2A. By subtracting this sum from r times B,;, we obtain, 
as. stated in equation (5), r times the value of Q,;, given in column (4) 


II 
or 


of Table 2A. 


Thus, for block (1) 
+216 
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TABLE 2A 


M1) (2) (3) (4) 
Block Treatments (i), yields and 7; Bi rQ (i) 
(7) Deli 
(35) (4) (18) (15) (1) (30) (17) 
389 502 512 314 442 404 428 2991 +216 
1294-1580 «1352 $18 1280 1088 1345 8757 
(2) (26) (11) (2) (31) (5) (19) (18) _ 
431 482 442 411 456 436 427 3085 + 23 
1229 «1494 9232 
(3) (34) (10) (19) (4) (8) (22) (21) - 
334 320 378 498 452 413 272 2667 + 69 
834 990 1159 1580 1391 1085 893 7932 
(4) (18) (9) (21) (3) (7) (25) (33) _ 
413 440 306 407 526 212 385 2689 —423 
1352 893 1357 1690 758 1041 8490 
~ (5) (8) (33) (23) (30) (14) (12) (26) _ 
443 286 265 328 417 280 326 2345 —667 
1391 1041 796 1088 1148 1009 1229 7702 
(6) (23) (22) (35) (5) (9) (6) (20) — 
271 286 455 504 526 515 314 2871 —120 
796 1085 1294 1536 1399 1513 #81110 8733 
; (7) (20) (16) (2) (33) (4) (28) (13) - 
366 491 556 370 580 284 434 3081 +518 
1110 1613 1345 1041 1580 795 1241 8725 
(8) (6) (21) (28) (12) (31) (15) (29) — 
406 315 285 365 380 260 365 2376 —276 
1513 893 795 1009 1117 818 1259 7404 
(9) (10) (27) (35) (25) (28) (11) (14) _ 
325 325 450 292 226 480 328 2426 —200 
990 999 1294 758 795 1494 11348 7478 
(10) (19) (32) (3) (1) (12) (20) (27) _ 
345 425 452 400 364 430 322 2738 — 42 
1159 1342 1357 1280 1009 #81110 999 8256 
(11) (17) (6) (32) (8) (25) (2) (24) _— 
431 592 510 496 254 347 275 2905 +158 
1345 1513 1342 1391 758 1345 863 8557 
(12) (7) (30) (29) (22) (13) (32) (11) _ 
642 356 476 386 376 407 532 3175 +326 
1690 1088 1259 1085 1241 1342 1494 9199 
(13) (3) (29) (5) (34) (16) (14) (17) —_ 
498 418 576 265 634 403 486 3280 +748 é 
1357 1259) 834 1613 1148 1345 9092 4 
(14) (13) (26) (34) (27) (24) (9) (15) _ 
431 472 235 352 276 433 244 2443 — 54 2 
1241 1229 834 999 863 1399 818 7383 HN 
qa) (16) (24) (10) (23) (7) 
438 488 312 345 260 522 2691 —276 
1280 1613 863 990 796 ©1690 8349 


G = 41763, G/or = 397.743 
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TABLE 2B 
TREATMENT ALLOCATION PLAN, INTRA-BLock EsTIMATES, AND 
ComsBiNnED INTRA- AND INTER-BLOCK [’sTIMATES 


(1) (2) (3) (4) Adjusted treatment 
mean with inter-block 
Treat- Blocks (3), Ti (r/b) > (3) information 
ment yield and and and 
i ri) (5) (6) 
Not 
recovered Recovered 
1 (1) (10) (15) _ -- 
442 400 438 1280 — 6.800 428 .933 
+216 —42 —276 —102 — 0.951 426 .984 
2 (2) (7) (11) _ _- 
442 556 347 1345 +46 .600 432.800 
+23 +518 +158 +699 + 6.517 446.161 
3 (4) (10) (13) 
407 452 498 1357 +18 .867 446 .044 
—423 —42 +748 +283 + 2.639 451 .454 
ql) (3) (7) 
502 498 580 1580 +53 .533 508 .822 % 
+216 +9 +518 +803 + 7.487 524.171 
5 (2) (6) (13) 
456 504 576 1536 +43 .400 497 .533 
+23 —120 +748 +651 + 6.070 509 .977 
6 (6) (8) 
515 406 592 1513 —15.867 509 .622 
-—120 —276 +158 —238 — 2.219 505 .073 
7 (4) (15) 
526 642 522 1690 —24.867 571.622 
—423 +326 —276 —373 — 3.478 564 .493 
8 (3) (5) (11) 
452 443 496 1391 —29 .333 473.444 
+69 —67 +158 -—440 — 4.103 465 .034 
9 (4) (6) (14) 
440 526 433 1399 —39.800 479 .600 
—423 -—120 —597 — 5.566 468 .189 
10 (3) (9) (15) _ = 
320 325 345 990 —27.133 339 .044 
+69 —200 —276 3.795 331.265 
482 480 332 1494 + 9.933 494.689 
+23 —200 +326 +149 + 1.389 497 .537 
12 (5) (8) (10) 
rf 280 365 364 1009 —65.667 358 .222 
i —276 —42 —985 — 9.184 339 .395 
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TABLE 2B—Continued 


(1) (2) (3) (4) Adjusted treatment 
mean with inter-block 
Treat- Blocks (j), Ti (r/b) Q3) information 
ment yield and and and 
Not 
recovered Recovered 
13 (7) (12) (14) 
434 376 431 1241 +52 .667 396.111 
+518 +326 —54 +790 + 7.366 411.211 
14 (5) (9) (13) 
417 328 403 1148 — 7.933 385.311 
—667 —200 +748 -119 — 1.110 383 .037 
15 (1) (8) (14) _ 
314 260 244 818 — 7.600 275.200 
+216 —276 —54 —114 — 1.063 273.021 
16 (7) (13) (15) 
491 634 488 1613 +66 .000 515.667 
+518 +748 —276 +990 + 9.231 534.590 
17 (1) (11) (13) 
428 431 486 1345 +74 .800 423.400 
+216 +158 +748 +1122 +10.461 444.846 
18 (1) (2) (4) _ 
512 427 413 1352 —12.267 454.756 
+216 +23 —423 —184 — 1.716 451.239 
19 (2) (3) (10) a cial 
436 378 345 1159 + 3.333 385.222 
+23 +69 —42 +50 + 0.466 386.178 
20 (6) (7) (10) — pee 
314 366 430 1110 +23 .733 362.089 
~—120 +518 —42 +356 + 3.319 368 .894 
21 (3) (4) (8) aad anes 
272 306 315 893 —42.000 311.667 
+69 —423 —276 —630 — 5.874 299 .625 
22 (3) (6) (12) 
413 286 386 1085 +18 .333 355.556 
+69 —120 +326 +275 + 2.564 360.812 
23 (5) (6) (15) 
265 271 260 76 —70 .867 288 .956 
—667 —120 —276 1063 9.911 268 .637 
24 (11) (14) (15) _— — 
275 276 312 863 —11.467 291.489 
+158 —54 —276 -172 — 1.604 288.201 
25 (4) (9) (11) _ _ 
212 292 254 758 —31.000 263 .000 
—423 —200 +158 —465 — 4.336 254.112 
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TABLE 2B—Continued 


(1) (2) (3) (4) Adjusted treatment 
mean with inter-block 
Treat- Blocks (3), Ti (r/b) Quy information 
ment yield and and and 
ter Not 
recovered Recovered 
26 (2) (5) (14) — = 
431 326 472 1229 —46 .533 425.178 
+23 —667 —54 —698 — 6.508 411.836 
27 (9) (10) (14) 
325 322 352 999 —19.733 339 .578 
—200 —42 —54 —296 — 2.760 333 .920 
28 (7) (8) (9) — = 
284 285 226. 795 + 2.800 264 .067 
+518 —276 —200 +42 + 0.392 264 .869 
29 (8) (12) (13) ~_ — 
365 476 418 1259 +53 .200 401.933 
—276 +326 +748 +798 + 7.440 417 .187 
30 (1) (5) (12) = = 
404 328 356 1088 — 8.333 365.444 
+216 —667 +326 —125 — 1.165 363 .055 
31 (2) (8) (15) 
411 380 326 1117 —35 .267 384 .089 
+23 —276 —276 —529 — 4.932 373.977 
32 (10) (Ti) (12) 
425 510 407 1342 +29 .467 437.511 : 
—42 +158 +326 +442 + 4.121 445.960 
33 (4) (5) (7) = = 
385 286 370 1041 —38 .133 359.711 
—423 —667 +518 —572 — 5.333 348.778 
34 (3) (13) (14) 
334 265 235 834 +50 .867 261.044 
+69 +748 —54 +763 + 7.114 275.629 
35 (1) (6) (9) 
389 455 450 1294 — 6.933 433 .644 
+216 —120 —200 —104 — 0.970 431.657 


The values of rQ,;, are reproduced in the third row of the cells of 
column (2) of Table 2B, below each block yield and against each treat- 
ment number, 7. Their sums, denoted by r >>;,, Q,;) are given below 
the values of 7; in column (3). By dividing these sums by 6 (in this 
case 15) we obtain the corrections, given in the upper entries in cells 
of column (4), to be applied to 7’; for obtaining the adjusted treatment 
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totals without recovery of inter-block information given in column (5). 
To obtain the adjusted treatment total with recovery of inter-block 
information, we have to multiply the second figure given against each 
treatment 7 in column (3) of Table 2B by the estimated value of u and 
subtract this quantity from 7; . 
To obtain the estimate of u, we have to complete the analysis of 
variance given in Table 3. 


TABLE 3 
ANALYSIS OF VARIANCE 


| Degrees | 
Source of variation | of | Sum of squares | Mean square 
freedom | | 
Blocks (adjusted) 14 41,028 .089 2930.578 (E,) 
Treatments (unadj.) 34 | 753 , 254.057 
Intra-block error. 56 | 141,159.911 2520.713 (E,) 
Total | 104 935,442,057 
Blocks (unadjusted) 182, 839.200 
Treatments (adjusted) 34 611, 442.946 17983.616 (E,) 


The total sum of squares, the block (unadjusted) and the treatment 
(unadjusted) sums of squares are calculated in the usual straightforward 
way. 

The block (adjusted) sum of squares is easily calculated by squaring 
the values given in the last column of Table 2A and using formula (6) 
in a slightly modified form, namely, 


1 2 2 2 
be = G5 [(216)* + (23)° + --- + 276)"] 


41028 .089 


The intra-block error sum of squares and the treatment (adjusted) 
sum of squares follow by subtraction. 

It will be noticed that the Block (adjusted) sum of squares and 
Treatment (adjusted) sum of squares given in Table 3 closely agree 
with those given in Tables 3.3 and 3.2 respectively on pages 38 
and 36 of Bose, Clatworthy and Shrikhande [1954]. The slightly bigger 
disagreement in the case of the former is due to a computational mistake 
in the treatment (unadjusted) sum of squares in their Table 3.3. 
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The intra-block variance ratio F = E,/E, = 7.134 is significant at 
the 0.1 per cent level. 
Estimate of yu is 


E, — E, 
A= 0.00932387. 

Each treatment total 7; given in column (3) of Table 2B may now 
be adjusted by subtracting from it either 1/b or uw times the value of 
r >. Q:;) given just below the value of 7’; , according as the adjustment 
desired is without or with recovery of inter-block information. The 
corresponding adjusted treatment means are obtained by dividing 
those totals by 3. They are given in columns (5) and (6) of Table 2B. 

(a) The variance of a difference between means, adjusted with 
recovery of inter-block information, of two treatments occurring in 
the same block is, using formula (10), 


2 Dal 


2(1.0186477 X 2520.713)/3 


1711.812 


(b) The variance of a difference between means, similarly adjusted, 
of two treatments not occurring together in the same block is, using 
formula (11), 


2 (1 + ru) = 2(1.0279716 x 2520.713)/3 


1727.481. 


(c) The mean variance of differences for all pairs of adjusted treat- 
ment means is, using formula (12), 


2 rv — k) -2( 42 ) 
2/14 (142, 
= 1719.186. 


To obtain values of the corresponding variances for differences 
between treatment means adjusted without recovery of inter-block 
information, we have only to replace u by 1/b = 1/15 in (a), (b) and (c). 
The values of these variances are 1904.539, 2016.570 and 1957.260 
respectively. 

The 1.s.d. values may now be calculated in the usual way. 
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A RANKIT ANALYSIS OF PAIRED COMPARISONS 
FOR MEASURING THE EFFECT OF SPRAYS ON FLAVOR 


C. I. 
The Connecticut Agricultural Experiment Station and Yale University, New Haven 
AND 
Mary L. GREBNWOOD AND Epna SAKAMOTO WHITE 


School of Home Economics and Storrs Agricultural Experiment Station 
University of Connecticut, Storrs, Connecticut, U.S.A. 


Experiments on consumer preference have taken many forms. In 
that known as “paired comparisons”, the several treatments or stimuli 
are compared in pairs, each ‘treatment appearing with every other 
treatment in the same pair. It is an especially appropriate design 
for testing the effect upon a food of chemically different pesticides. 
These may produce qualitative differences in flavor which the subject 
finds easier to compare in pairs. Usually he is asked to report a simple 
preference for one of the two samples in each pair. Scheffé [1952] has 
extended this procedure by asking each subject to note the degree of 
his preference within each pair as well as its direction. 

A recent experiment of this type concerned the relative palatability 
of apples sprayed with two different insecticides and with two different 
fungicides in a 2 X 2 factorial design. In its analysis, several statistical 
methods were to be compared in respect to their sensitivity, consistency 
and computational requirements. Among them were the Mosteller 
[195la,b] modification of the original proposal by Thurstone, the 
Bradley-Terry [1952a,b; 1953] analysis, and the Scheffé [1952] tech- 
nique. In the course of the study, a new approach was developed which 
shows particular promise. It is based upon the mean normal deviate, 
tabled by Fisher and Yates [1953] and called the “rankit” by Ipsen 
and Jerne [1944]. It answered several questions which we could not 
otherwise resolve so readily, if at all. We will describe this rankit 
analysis in its present application and compare it with alternative 
procedures. 

The experiment had several objectives. One was to arrange the 
treatments on a linear scale, spaced so as to reflect the average degree 
of preference expressed by the tasters. Criteria were needed for judging 
the significance on this scale of effects associated with factorial combi- 
nations of the individual treatments. The scale had to be validated by a 
test of its additivity or subtractivity. If, for example, B was preferred to 
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A and C to B, would C be preferred to 4? Did the replicates of the 
experiment agree? 

Also, there was no assurance a priori that the tasters would agree. 
Some might judge more consistently than others in replicated tests. 
Or, though consistent, they might differ in their response to specific 
factorial comparisons. Both kinds of discrepancy could be important 
to the experimenter. The order of presentation of the two treatments 
comprising a pair, or of a given pair in a sequence of pairs may have 
been a factor. The samples tasted first, for example, might modify the 
subject’s reaction to the following samples, even though the pesticide 
affected flavor only indirectly, by influencing the content of sugars, 
acids or other constituents in the fruit (Garman ef al. [1953]). Informa- 
tion regarding these possibilities was therefore sought. Finally, an 
analysis with good discriminatory power which required a minimum 
of specialized tables or of involved statistical calculations would be 
preferred. The proposed rankit analyses will be examined with these 
objectives in mind. 


EXPERIMENTAL PROCEDURE 


Samples and their preparation 


The test samples were randomly-selected Cortland apples from the 
fall harvest in an experimental orchard of The Connecticut Agricultural 
Experiment Station at New Haven in 1951 (Garman ef al. [1953]). 
They had been subjected to four different spray treatments involving 
two insecticides, lead arsenate and parathion, and two fungicides, 
thiram and sulfur, in a 2 X 2 factorial design. Earlier work (Greenwood 
et al, [1951]) had demonstrated that unsweetened apple sauce was a 
better medium than raw apples for the detection of flavor differences. 
Not only is sauce more homogeneous, but it can be made more repre- 
sentative by preparing each sample from portions of several apples. 
In order to avoid any changes which might take place in raw apples 
during storage, sauce sufficient for all of the taste tests was made on a 
single day. 

In preparing the sauce, ten apples from each treatment were selected 
for each test day. After these had been washed in detergent suds, 
rinsed and wiped, they were quartered, cored, and weighed in 300 gram 
portions. Each such portion required about one-fourth of each apple 
and made a sample of sauce sufficient for 30 to 35 tasters. After cutting 
into 16th’s, the apple segments comprising each sample were added 
to 150 grams of boiling tap water, covered, and boiled gently for 15 
minutes, with one stirring after 10 minutes. After cooking, each sample 
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was weighed, sieved through a Foley mill with 50 turns of the handle, 
cooled, put into a standard waxed container, and frozen. Before each 
day’s taste test, the required samples of apple sauce were thawed and 
allowed to come to room temperature. 


Test procedure 


On each test day, samples representing the four different combina- 
tions of spray components were offered to the tasters in six pairs, each 
treatment with every other treatment. The sample pairs were identified 
by Roman numerals and tasted in the order indicated on the score 
sheet (Table 1). For each pair, the taster was asked: (1) to give his 


TABLE 1 
INDIVIDUAL ScorE SHEET FOR Eacu Taste TEst 
Instructions: Do not discuss your reaction with other tasters. Compare sample A 
with B in each pair and state preference and degree of preference. (Always make a 
choice.) The first time taste sample A and then sample B, but you may go back 
and re-taste in any order. Start with Pair I and work toward the last pair. 


Pair I II III VI 


I prefer (A or B) 


The difference is: 
(check one) slight 


moderate 


large 


no difference 


Comments: 
University of Connecticut Date: 
School of Home Economics Test No: 


preference for sample A or B, in every case making a choice between 
them, and (2) to check whether he considered the difference slight, 
moderate, large or, if his initial choice had been arbitrary, non-existent. 
The sample pairs which corresponded to the Roman numerals on a 
given score sheet were determined for each taster by the sequence in a 
row of the 6 X 6 Latin square in Table 2. The treatment designations 
beneath Table 2 follow the customary nomenclature for a 2 X 2 factorial 
design. Only one insecticide, lead arsenate (L), and one fungicide, 
thiram (Th), are represented by letters; when either was absent, it was 
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TABLE 2 
LATIN SQUARE FOR ASSIGNING PaiRs OF SAMPLES 
(Designated on the score sheets as A and B respectively) to successive subjects on 
the first, third and fifth test day, to insure that all orders of tasting were represented 
on each occasion. On the second, fourth and sixth test day, every subject received the 
same sequence of sample pairs as on the preceding test day, but in the reverse order. 


Order of tasting pairs in sequence 
Sequence 
I II III IV V VI 

1 LTh:Th L:Th (—):Th LTh:L (—):LTh L:(-) 
2 L:Th LTh:Th Li(—) (—):LTh (—):Th LTh:L 

3 (—):Th L:(—) L:Th LTh:TH LTh:L (—):LTh 
4 LTh:L (—):Th (—):LTh L:Th L:(—) LTh:Th 
5 (—):LTh LTh:L LTh:Th L:Th (—):Th 
6 (—):LTh LTh:L (—):Th LTh:Th L:Th 


Spray TREATMENTS 


LTh = lead arsenate and thiram 
L = lead arsenate and sulfur 
Th = parathion and thiram 
(-—) = parathion and sulfur 


replaced in the spray mixture by its alternate, parathion or sulfur. 

The tasters, including both men and women, were students and staff 
members at the University of Connecticut. On any one day they 
numbered from 31 to 34, but of these only 25 were present on all six 
test days. The present paper is based on the data from these 25 tasters. 
Since they were not asked to assess an attribute but merely to state 
an overall, subjective preference, they were given no preliminary 
training. Some, however, had participated in other tests of a similar 
nature. 


Sample sequence 


A primary objective of the Latin square in Table 2 was to insure 
that every pair of samples would occur in each position in the sequence 
of six pairs. The six replicate tests were arranged in three pairs. On 
the first test day, the tasters were assigned in rotation on order of arrival 
to the successive rows in the Latin square. Two days later, the test 
was repeated and each subject received his pairs of samples in exactly 
the same sequence as in the preceding test, but with the order within 
each pair reversed. A pair consisting of L:Th on the first day, for ex- 
ample, was presented in the order Th:L on the second day. On the 
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third and again on the fifth test days, the tasters were reassigned to 
the sequences 1 to 6 in the order of their arrival. 

By virtue of this design, the order of tasting within each pair was 
fully balanced and its effect will be tested in later sections. This was 
not true, however, between pairs. Since the 25 tasters did not arrive in 
any planned order, the sample sequences were not balanced among them. 
In consequence, the number of repetitions of each sequence varied from 
20 to 30 and the number of different tasters represented in each sequence 
from 8 to 13, with 10 tasters repeating a sequence and one taster follow- 
ing the same sequence (No. 3) in all six tests. The incomplete balance 
among sequences was a potential source of bias if the preference asso- 
ciated with a given insecticide or fungicide were to increase or diminish 
during the tasting of six paired samples of apple sauce. 

To test this possibility, the percentage preference for lead arsenate 
(L) over parathion was determined for each position (J to VJ) in the 
order of tasting when both were paired with the same fungicide. Similar 
percentage preferences were determined for the thiram (Th) over sulfur, 
when both were paired with the same insecticide. These preferences 
are given separately for each sequence in Table 3. The percentages 


TABLE 3 
DISTRIBUTION OF TasTERS, TRIALS AND PERCENTAGE PREFERENCES 
AMONG THE SEQUENCES IN TABLE 2 


Preferences for lead arsenate (L) and for thiram (Th) restricted to pairs with the 
same fungicide or insecticide, respectively. 


No. of No. of Percentage preference for L or Th 
Sequence | tasters tests in specified pair 

I II Ill IV Vv VI 
1 12 30 53.3* 60.0 73.3 63 .3* 
2 11 28 50.0* 39.3* 60.7 57.1 
3 8 20 45.0 40.0* 50.0* 65.0 
4 10 22 50.0 40.9 54.5* 63.6* 
5 13 26 61.5 50.0* 57.7* 65.4 
6 9 24 54.2* 70.8 45.8 58.3* 


*Lead arsenate (L) compared with parathion in a pair with the same fungicide; in the remaining 
percentages, thiram (7) was compared with sulfur in a pair with the same insecticide. 


within each row represented the same group of tasters and hence were 
comparable by pairs, without confounding their positions in a sequence 
of tests with differences between tasters. Although the preference both 
for lead arsenate and for thiram tended to increase with order of tasting 
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I to VI, the trend within tasters was not significant for either spray 
component (6 = 1.93 + 1.14 for lead arsenate and 6 = 1.11 + 1.87 
for thiram). 

The effect of order of tasting was tested more critically by computing 
an analysis of covariance of degree of preference y upon the linear term 
for position in the sequence of pairs. The six successive paired tests 
were assigned covariate values of vy = —5, —3, —1, 1, 3 and 5 respec- 
tively for computing sums of squares and products paralleling those of 
the rankits in Table 12. The regression of y upon v in the error (row 14) 
was not significant (F = 0.22) and, when F values were computed from 
the “reduced”? mean squares, all of those in the preceding rows were 
substantially unchanged. Since the position of a pair had no demon- 
strable effect in either test upon the resulting preference, differences 
in sequence have been ignored in all later analyses. 


THE RANKIT ANALYSIS OF SIMPLE PREFERENCES 


The rankit analysis of simple preferences follows essentially the same 
model as Mosteller’s [1951a] development of the Thurstone approach. 
Each stimulus, represented here by the apple sauce from one of the 
spray treatments, gives rise in the subject to a taste sensation, which 
can be located on a subjective scale. In replicated tests on the same 
subject, the sensations from each stimulus are assumed to be normally 
distributed. During the test, stimuli or samples are presented in pairs, 
each sample giving rise to a sensation. These sensations the subject 
compares and reports one as the greater, no ties being allowed. If the 
paired sensations are correlated, the correlation is assumed to be equal 
for all pairs. Just as the sensations in replicated tests on the same 
subject may be considered as varying normally, the same postulate 
of normality is adopted for the sensations in different subjects. 

Although the stimuli may have no measurable physical character- 
istic, they can be spaced on a linear scale of normal deviates by means 
of the responses. In the present experiment, each of the four treat- 
ments, representing the 2 X 2 factorial combinations of two insecticides 
and two fungicides, occurred in three pairs. The proportion of choices 
in which each was preferred to the other member of the same pair 
could be transformed to a normal deviate and the three deviates for 
each sample averaged, with 4 as the divisor. These averages or scores 
would measure the spacing on the hypothetical preference scale of the 
four separate treatments or stimuli. Mosteller [1951b] has described a 
x’ test for determining whether such a scale can be considered linear. 
One could compute an expected difference in normal deviates for each 
of the six paired comparisons, convert these differences to proportions 
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and the proportions to equivalent angles. The corresponding observed 
proportion for each pair would then be converted to its equivalent 
angle, and the observed and expected angles compared by x’. 

In a replicated experiment, the above process can be simplified 
by means of rankits, tabled as “scores for ordinal (or ranked) data” 
by Fisher and Yates [1953]. Each rankit is the expected mean deviate 
of the 1st to Nth item in a ranked sample of N items from a normal 
population which has a mean of 0 and a standard deviation of 1. With 
rankits, the number of times a given treatment is preferred, when 
presented with every other treatment, can be converted directly to 
mean deviates. Non-additivity in the rankits may be tested by the F 
ratio from an analysis of variance, its error term being the interaction 
of non-additivity by replicates. We will assume here that the rankits 
for different stimuli or treatments differ so little, if at all, in their inherent 


precision, that negligible information is lost by weighting all values 
equally. 


Analysis with the replicate as the unit 


In the present experiment, the same 25 tasters judged all six pairs 
of samples on each of six test days. Consequently analyzing the data 
with the test day or replicate as the unit, differences between tasters 


TABLE 4 


FREQUENCY WITH WHICH APPLESAUCE FROM EacuH Spray TREATMENT 
Was PREFERRED IN EacH PatRED COMPARISON BY 25 TASTERS 
In odd-numbered tests, the two samples of each pair were tasted in the order 
shown in Table 2 (order a); in the even-numbered tests this order was reversed 
(order b). The chi-square values for the consistency of preferences have been 
computed as described by Mosteller (1951b]. 


Frequency of preference for Test of 
Test Date Order each treatment in each paired comparison additivity 
1951 in pairs 

Th:LTh| Th:L | Th:i(—) | LTA:L |LTh:(—)| x? F 
1 11/13 a 11 14} 12 13] 13 12] 14 11) 16 9{10 1.422 .71 
2 11/15 b 10 15/17 10|17 14 11/]0.918 .86 
3 11/27 a 14 11/11 14/13 12/16 12] 3.092 .38 
4 11/29 b 9 16/15 10/13 9/18 15]1.217 .76 
5 12/4 a 11 14/15 10] 14 11/16 12] 12 13] 0.527 .92 
6 12/6 b 14 11/11 14) 13 12]16 6) 6.243 .10 
Total 69 81/81 69] 81 69/95 55 |103 47] 78 72 | 2.649 .44 

2 


Treatments: LTh = lead arsenate and thiram, L = lead arsenate and sulfur, Th = parathion and 
thiram, (—) = parathion and sulfur. 
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neither biased the ratings nor entered into the analysis. Table 4 
records the number of times that each apple sauce was preferred when 
presented in the pair listed at the top of the table. In tests 1, 3 and 5, 
the two samples forming each pair were tasted in the order shown in 
Table 2 (order a). Two days later, each subject repeated the test 
(Tests 2, 4 and 6) with the same sequence of pairs as in his preceding 
trial, but with the order of the two samples, within each pair, reversed 
(order b). The six replicates, therefore, formed three pairs of tests, 
which determined, in part, the structure of the analysis. 

As a first step, the frequencies of preference in Table 4 were trans- 
formed directly to rankits. With 25 tasters, the number preferring 
the first sample in a given pair could vary from 0 to 25, so that rankits 
were required for a series of 26. Since the actual number of such choices 
in the present experiment ranged only from 9 to 20, the corresponding 
rankits in Table 5 (left) are restricted to the 10th to the 21st item. As 


TABLE 5 


MEAN Norma DeviaTEs (7) OR RANKITS FOR 
SERIES OF 26 AND OF SEVEN ITEMS 


(for transforming the data in Tables 4, 8 and 10) 


For 26 alternatives For 7 alternatives 
Frequency Frequency Frequency Degree 
of Rankit, of Rankit, of of Rankit, 
preference y preference y preference preference y 
9 — .34 15 24 0 —3 —1.35 
10 — .24 16 34 1 —2 — .76 
11 —.14 17 44 2 —1 — .35 
12 — .05 18 55 3 0 0 
13 .05 19 67 4 1 .35 
14 14 20 79 5 2 .76 
6 3 1.35 


these cover only the middle of the rankit scale, a direct analysis of the 
number of preferences, without transformation, would here have given 
substantially the same results. The second sample in each pair had 
numerically the same rankit as the first sample, but with the sign 
reversed, so that only the rankits for the first sample were required. 
The transformed values from Table 4 are given in the body of Table 6, 


each referring to the first of the two paired samples listed at the top 
of its column. 
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TABLE 6 


RANKITS FOR THE REPLICATED TESTS IN TABLE 4 AND 
TuHerr FactortaL CoMBINATIONS 


Test Date, Order Rankit y for each pairing Yer for effect of 
1951 in pairs |ThA:LTh Th:L Th:(—) LTh:L LTh:(—) Th L ThXL 
4 
4 
1 11/13 a —-.14 —.05 .05 .14 —.24 .48 -29 47 
% 2 11/15 b —.24 44 .24 .67 14 1.79 -61 .30 
3 11/27 a 14 —.14 .05 .34 .44 -05 -69 -49 -10 
4d 11/29 b — .34 .24 -05 .34 55 —.24 1.18 .41 .87 
5 12/4 a —.14 .24 .14 .34 05 —.05 77 —.10 .39 
6 12/6 b 14 -—.14 .05 34 .79 .67 1.04 1.46 —.52 
Total, Ti — .58 .59 58 1.94 2.84 .33 5.95 3.16 1.61 
— Di -30 —.49 —.10 —.30 —1.18 —.81 |—2.07 —1.80 
Coefficients Th 0 1 1 1 1 0 
z for L -1 -1 0 0 1 1 
effect of ThXL -1 0 -1 4 0 -1 


The mean rankit or score for each of the four samples or treatments 
was determined from the sums (7’,) of the rankits for each pair in which 
it occurred, adding those from the columns in Table 6 where the treat- 
ment was listed first and subtracting those where it appeared second in 
the pair. The total for treatment Th, for example, was —.58 + .59 + 
.58 = .59, and for LTh, L and (—) it was 5.36, —2.20, and —3.75 
respectively. The following mean rankits from these totals, 7 = 
> y/(4X6), representing the spacing of the four treatments in mean 
normal deviates, closely paralleled those obtained by applying the 


Treatment Th LTh L (-—) 
Mean rankit, 7 .0246 .2233 | —.0917 | —.1562 
Mean deviate, S’ .0256 .2362 | —.0967 | —.1651 


Thurstone-Mosteller technique separately to each replicate and averag- 
ing the values of S{. Their relative magnitudes corresponded almost 
exactly (Table 14). 

The treatments in the present series formed a 2 X 2 factorial, 
as indicated by the symbols in Table 2, giving contrasts involving 
thiram v. sulfur, lead arsenate v. parathion, and the influence of each 
of these on the effect of the other, i.e., their interaction. These three 
contrasts have been designated as the effect of Th, of L, and of Th X L, 
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respectively. To compute their numerical values direetly, the pair 
totals 7’, were multiplied by the corresponding coefficients 7 at the 
bottom of Table 6 and the products summed. The totals of 5.95 for 


TABLE 7 
ANALYsts OF VARIANCE OF THE Reriicatep Test RANKITS IN TABLE 6 


Sum of Mean 
Row Source 1).f. | squares square F 

1 Effect of Th 1 1.4751 1.4751 33. 16°* 
2 1 -4161 9.35* 
3 * FRX 1 . LO80 . 1080 2.43 
4 Average non-additivity 3 . 1607 0536 1.20 
5 Order X Th I .1785 .1785 4.01 
6 1 . 1350 1350 3.03 
7 1 -0040 -0040 .09 
8 non-additivity 3 .0319 -72 
9 Test X Th 4 .0908 .0227 -53 

10 4 2004 -0501 1.17 

ll xa, 4 .0658 1.54 

12 “ non-additivity 12 .5135 .0428 1.00 

13 Total 36 3.6408 

14 Test interactions 24 1.0678 -04449 1.00 


< 001, =P < 0601. 


the direct effect. of Th, of 3.16 for the direct effect of L, and of 1.61 for 
the interaction Th X L represented the three degrees of freedom among 
the four treatments. 

The analysis of variance in Table 7 has been computed directly 
from the data in Table 6. The treatment effects in rows 1 to 3, computed 
as (eT) /4k with 4 = and = 6 replicates, accounted for 
three of the six degrees of freedom among the six pair totals 7; . The 
other three degrees of freedom measured non-additivity in the series, 
or the failure of our observations to conform exactly to a linear order 
on the seale of rankits. Its sum of squares (row 4+) was computed from 


the totals 7, and the three preceding sums of squares as (}> T3/k) — 


(OY? 

Beeause the two samples within each pair were always presented 
in the a series in the order shown in ‘Table 2, and in the 6 series in the 
reverse order, we could measure the effect of reversing the. order of 
tasting upon the tasters’ preferences. The three rankits for order b 
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were subtracted from the three rankits for order a to obtain the differ- 
ence D; for each treatment pair (Table 6). Substituting D, for T; 
in the formulae of the preceding paragraph gave the sums of squares 
in rows 5 to 8 of Table 7. 

Although the remaining variation among the y’s in Table 6 might 
have been pooled at once into an experimental error, its homogeneity 
could be tested by first isolating the “interaction” i.e. discrepancy of 
each factorial effect by replicates, i.e. tests. Their respective sums 
of squares in Table 7 were computed in turn from the last three 
columns of zy in Table 6, as zy)*/4) — — 
(>-?(xD,)/4k). The sums of squares in rows 1 to 11 were subtracted 
from the total, >> y’, to obtain the remainder, that for the “interaction” 
of tests and non-additivity. With its mean square (row 12) as a pro- 
visional error, the tests clearly agreed in their rating of the three effects 
of treatment; in consequence, all interactions of tests by treatment 
were combined (row 14) into a single error mean square with 24 degrees 
of freedom. 

The variance ratios F showed a highly significant preference for 
thiram over sulfur and a smaller but still very significant preference for 
lead arsenate over parathion, with no significant interaction between 
the fungicide and the insecticide. Although the response to both direct 
effects may have been influenced in part by the order of presentation, 
neither interaction mean square was significant (P > 0.05). Finally, 
the F values in rows 4 and 8 gave no reason to question a linear relation 
among treatments on the rankit scale, as postulated in setting up the 
analysis. This agreed with the x’ test for non-additivity on the scale 
of norma] deviates (Mosteller [1951b]) in the last two columns of Table 4. 


Rankit analysis with the taster as the unit 


Since every subject appeared once in each of the six replicates, the 
data from the 25 subjects could be pooled and the agreement among 
replicates compared. To pool replicates and examine the agreement 
among subjects was equally legitimate. The reactions of each taster 
to each treatment pair were therefore totalled over all six replicates, 
giving for Th:LTh, Th:L etc. for taster 1, for example, frequencies of 
preference 3:3, 4:2, 4:2, 3:3, 6:0, and 1:5, leading over all 25 tasters 
to exactly the same treatment totals as in Table 4. The frequency of 
preference of each taster for the first of the two items in each paired 
comparison was then converted directly to its rankit in a series of seven 
(Table 5, right). The rankits for the above preferences of taster 1, for 
example, were y = 0, .35, .35, 0, 1.35, —.76, respectively, as shown in 
Table‘s. 


3 
4 
| 
4 
x 
> 


392 BIOMETRICS, DECEMBER 1956 


TABLE 8 
RANKITs FOR INDIVIDUAL TASTERS 


Rankits, from Table 5 for a series of seven, for the frequency of preferences by each 
taster in specified comparisons. 


‘Taster Rankit y for the treatment listed first in each paired comparison 
Th:LTh Th:L Thi(—) LTh:L 

1 0 35 .35 0 1.35 — .76 
2 — .76 —1.35 —1.35 0 .76 .35 
3 — .76 0 35 — .35 .76 — .76 
4 76 .76 0 1.35 0 0 
5 0 0 0 1.35 1.35 0 
6 -76 0 0 0 — .35 0 
7 0 .76 .35 76 35 .35 
8 .35 — .76 0 0 .76 .35 
9 0 .76 35 0 0 — .76 
10 — .35 0 0 — .35 0 0 
11 — .35 .35 0 — .35 0 — .35 
12 — .76 0 35 76 1.35 — .35 
13 0 0 .76 -76 35 
14 0 — .35 35 .35 
15 — .35 35 — .35 0 .76 — .35 
16 0 35 35 1.35 35 35 
17 — .35 — .35 — .35 — .35 0 0 
18 — .35 35 — .35 1.35 76 0 
19 35 35 — .35 .76 0 76 
20 — .35 — .76 0 0 — .35 76 
21 0 35 76 -76 1.35 76 
22 0 — .35 0 — .76 — .35 35 
23 0 0 35 0 76 — .76 
24 35 35 0 35 0 1.35 
25 — .35 35 1.35 76 76 — .76 
T; —2.16 1.86 2.16 8.50 11.48 1.23 


The analysis of variance in Table 9 was computed from Table 8. 
For each of the three factorial contrasts, the sums of the rankits (7';) 
for the 25 tasters were multiplied by the coefficients x at the bottom of 
Table 6, and the contrast sum of squares computed as in Table 7 but 
with k = 25. The residual variation among the 7',;’s gave the average 
non-additivity in row 4. To measure the agreement between subjects, 
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> zy for each treatment factor, Th, L and Th X L, was determined 
for each individual (as in the last three columns of Table 6), leading by 
a similar calculation to the sums of squares in rows 5 to 7 of Table 9. 
The remainder (row 8) measured the “interaction” of tasters and non- 
additivity, which served as the error. 


TABLE 9 
ANALYSIS OF VARIANCE OF THE RANKITS FOR INDIVIDUAL TASTERS IN TABLE 8 
Sum of Mean 
Row Source Df. squares square F 
1 Th 1 5.7600 10.64** 
2 L EN 1.6926 7.81° 
3 1 .5285 2.44 
4 Average non-additivity 3 . 7526 .2509 | 1.16 
5 Tasters X Th 24 12.9956 .5415 | 2.49** 1.00 
6 XL 24 6.6400 .2767 | 1.28 
7 24 4.5116 . 1880 .87 
8 X non-additivity | 72 15.6100 .2168 | 1.00 
9 Total 150 48.4909 
*P < 0.01, < 0.008. 


From the F values in Table 9, it is concluded that the tasters reacted 
quite similarly to the comparison of lead arsenate with parathion, and 
even more similarly to the interaction of insecticide by fungicide 
(Th X L). Their responses to thiram vs. sulfur,(Th), however, differed 
significantly. Over all tasters, neither nori-additivity nor the inter- 
action Th X L was significant. Though smaller than in Table 7, the 
F value for lead arsenate was clearly significant. The average effect of 
thiram, when tested against the interaction of tasters by thiram, was 
also smaller but still significant. 

A common problem in taste tests is the internal consistency of the 
subjects. Individuals whose scores are not additive presumably are 
poor subjects, and greater sensitivity should result if they could be 
eliminated. For each subject, an adaptation of Mosteller’s x’ test 
for non-additivity with 3 degrees of freedom has been computed from 
the observed and expected angle for each of the six paired comparisons. 
Of the 25 tasters, only two had a significant x’ at P = .05, about the 
number which would be expected by chance. When totalled, >> x? = 
85.50 with 75 degrees of freedom (P = 0.2). So far as could be judged 
from x’, the tasters in this experiment did not differ significantly in the 
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consistency of their preferences, but with only six replicates for each 
taster, the test may have been too small to identify discrepant indi- 
viduals. 


A RANKIT ANALYSIS FOR DEGREE OF PREFERENCE 


Scheffé [1952] has extended the method of paired comparisons by 
asking the subject to report not only which sample he prefers in each 
pair, but also his degree of preference. This extension was included 
in the present experiment (Table 1). For analysis, Scheffé ranked each 
preference in his example from —3 to +3, but noted from the distri- 
bution of these ranks that scores would be preferred which reduced 
the difference between 0 and 1 and increased that between 2 and 3. 
In line with this suggestion, ranks here were transformed to the rankits 
for a series of seven (Table 5). The calculation of the analysis of vari- 
ance as given below has been simplified and extended from that reported 
by Scheffé. 


Analysis of variance 


The original data, of one rankit for each single-pair judgement, 
formed a block of 900 responses, representing the six pairs of samples, 
six replicate tests and 25 tasters. The frequency distribution of the 
rankit response for the 25 tasters in each row of Table 10 represents one 
replicate of each treatment pair. The dip in the total frequencies at 0 
preference is similar to that noted by Scheffé in his experiment, but the 
small frequencies at the ends of the scale suggest that seven choices 
were about right. The sums of squares in rows 1 to 9 of Table 12 were 
computed from the >> fy in the last column of Table 10 and in the last 
two rows of Table 11, by equations similar to those used for Table 7. 
In each case, the divisor was the number of rankits y entering into the 
number that was squared, or 600 for each treatment effect and 150 for 
non-additivity. 

For computing the variation among tasters, the six responses y 
of each taster to each pair of stimuli were totalled to obtain the >> y in 
Tabie 11. The >> y in each row were then multiplied, in turn, by the 
factorial coefficients x (Table 6) for each treatment effect, and from 
these sums )> (x > y) their “interactions” with tasters in Table 12 
were computed. The “interaction” of tasters and non-additivity was 
determined as a difference, starting with the 150 values of >> y in 
Table 11. The frequency distribution of the y in the last line of Table 10 
gave the total sum of squares with 900 degrees of freedom and the error 
sum of squares as a remainder (Table 12). 
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TABLE 10 
NuMBER OF Tasters (f) with Each DEGREE OF PREFERENCE y (IN RANKITS) 


FOR THE First OF THE Two LisTED TREATMENTS 
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TABLE 11 
Torat Grapep Resronse or Eacu Taster Rankits (>, y/) 
FOR THE First oF Parr oF TREATMENTS IN Six TESTS 
The error sums of squares [y*], each with 30 degrees of freedom, were computed 
from the individual y for each taster. 
> y for specified pair comparison 
‘aster [y?] 
Th:LTh Th:L Th:(—) LTh:L LTh:i(—) L:x(—) |n = 30 
1 —1.11 .70 1.05 — .70 4.37 
2 —3.04 —3.74 —3.33 — .82 2.63 1.93 7.27 
3 —1.46 41 1.52 — .70 1.81 —1.87 5.37 
; 4 76 1.75 .06 1.40 .06 0 3.37 
: 5 — .76 —1.52 41 2.63 2.98 — .76 6.23 
6 3.04 — .59 .35 35 —1.46 .82 10.67 
Ff — .41 1.05 1.46 1.40 1.05 .76 4.78 
8 1.29 —2.63 —1.00 — .41 3.81 — .06 16.60 
9 0 2.81 1.52 0 —1.17 -1.11 11.80 
10 — .70 0 0 —1.11 0 0 4.58 
; 11 — .70 .29 41 — .70 0 —1.11 5.36 
12 —1.75 — .4l 70 1.40 2.10 -1.11 4.22 
13 — .35 .35 0 1.40 1.05 70 1.94 
14 .06 — .20 .76 .70 .70 5.51 
15 — .70 .70 — .70 0 1.40 — .70 3.76 
16 — .06 2.05 2.05 2.16 76 Bel 9.02 
B 17 — .70 — .70 — .7 — .70 0 0 4.08 
18 —1.93 1.29 — .52 2.22 1.05 — .76 10.04 
19 1.52 1.87 — .70 1.81 41 2.22 7.30 
+ 20 —2.05 —2.40 — .35 — .35 — .29 2.22 6.62 
21 | -1.17 70 43.05 404 5.92 2.22 | 7.67 
22 0 — .35 — .35 —1.40 — .70 35 3.33 
23 — .18 — .41 .70 1.41 4.22 —1.99 14.75 
24 a-11 1.93 0 .76 .35 3.33 7.75 
25 | ° —1.93 2.17 3.16 2.98 3.22 .06 7.96 
Sum | —11.22 5.44 9.61 19.17 30.95 6.66 | 174.35 
Diff.* 3.74 —5.42 — .35 —7.59 -8.71 -—11.34 


*From the last column of Table 10, subtracting the > fy for the even-numbered tests from those 
for the odd-numbered tests; used in computing rows 5 to 8 of Table 12. 
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TABLE 12 
ANALYSIS OF VARIANCE FOR DEGREE OF PREFERENCE IN RANKITS, 
: CoMPUTED FROM THE Data IN TABLES 10 AND 11 
The F values in rows 1 and 2 have been determined with the mean squares in 


rows 10 and 11 respectively. The values of F in the last two columns are from a 
comparable analysis in ranks, without transformation to rankits. 


Sum of | Mean F from 
Row Source D.f. | squares | square F ranks 
1 |Treatment Th 1 | 7.0785 |7.0785 9.16** 9.32** 
2 L 1 3.1378 |3.1378 6.07* 6.12* 
3 ThxL\ 1 .3323 | .3323 | 1.43 1.65 
4 |Non-additivity 3 .2352 | .0784 | .34 .31 
5 |Order X Th 1 .8118 | .8118 | 3.50 3.85 
xe 1 .5624 | .5624 | 2.42 2.28 
1 .0002 | .0002 .00 .00 
8 “ 
X non-additivity; 3 .6626 | .2209 | .95 1.03 
9 |Other test 
interactions} 24 | 5.2201 | .2175 94 .93 
10 |Tasters X Th 24 | 18.5464 | .7728 | 3.33°° 1.00 || 3.09°° 1.00 
1l oe xb 24 | 12.4104 | .5171 | 2.23° 1.00 2.16** 1.00 
12 “ xX Thx Li 24) 4.9044] .2044| .88 .98 
13 
X non-additivity} 72 | 16.5759 | .2302] .99 ..95 
14 |Remainder or error} 720 |167.1007 | .2321 | 1.00 1.00 
15 Total 900 |237 .5787 


*P < 0.025, *P<0.01, °P <0.005, °P <0.001. 


The single Table 12 includes all of the comparisons previously 
divided between Tables 7 and 9, where the extent of the preference 
was not measured. Each F for non-additivity (rows 4, 8 and 13) was 
now less than 1. Those for the interaction between treatments, Th X L 
(row 3) and for the effects of order of tasting within pairs (rows 5 to 8) 
decreased. Conversely, the “interaction” with tasters of each direct 
effect of treatment, Th and L, increased to a high degree of significance 
(rows 10 and 11). Both main effects in rows 1 and 2 were compared 
against these “interactions”, leading to somewhat smaller but still 
significant F values. Measuring the degree of preference has thus 
increased our information on certain key points. Relative to the 
variability among tasters, the direct effect for the fungicide (Th) and 
for the insecticide (L) were significant at the 1 percent and at the 2} 
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percent levels, respectively, so that the experiment was not needlessly 
over-size. 

Was anything gained by substituting rankits for the simple rank 
order, as measured by the digits —3 to 3? Rankits have the theoretical 
advantage of greater conformity with the model for the analysis of 
variance and are in line with the frequency distribution of degree of 
preference. For an empirical check, the analysis was recomputed with 
Scheffé’s original ranks, leading to the values for F in the last columns 
of Table 12. Both criteria gave substantially the same conclusions 
with few discrepancies. The effect of order upon the preference for 
thiram, for example, was apparently just significant (P = 0.05) in 
ranks, but not in rankits. Since the additional labor was not large, we 
would prefer a rankit analysis. 


_ Homogeneity of the variance among tasters 


In terms of simple preferences, the 25 tasters showed good internal 
consistency but with the introduction of degree of preference, the picture 
changed. The same variability appeared among individual tasters 
that has characterized other taste experiments in which samples are 
graded on a hypothetical scale. Among the 25 tasters participating in 
the present experiment, four found no difference in flavor which they 
would designate as more than “‘slight’”’; nine found one or more differ- 
ences which they classed as “large’’. For a more precise measure, an 
“error” sum of squares, [y’], with 30 degrees of freedom was computed 
for each taster from the variation in y within each paired comparison 
(Table 11). The total of the 25 sums of squares was equal to the error 
sum of squares in Table 12 (row 14) plus the nearly equal interactions 
with order and test in rows 5 to 9. Bartlett’s test indicated a highly 
significant heterogeneity in the variance among individual tasters, 
giving x° = 88.51 with 24 degrees of freedom, but since this test is 
-equally sensitive to non-normality (Box and Andersen, [1955]), both 
may have been involved. 

The distribution of log [y’] was relatively symmetrical, so that the 
discrepancy among tasters could not be traced to one or a few aberrant 
tndividuals. It was due more probably to discrepancies in scaling, 
ascribable to differences either in taste acuity or in the interpretation 
of the terms “slight”, “moderate”, and “large”. No transformation 
of grades is likely to correct this type of variability among individual 
tasters. Any resulting heterogeneity in the error variance is not be- 
lieved to have invalidated the analysis in Table 12 in view of its con- 
sistency with the analyses in Tables 7 and 9, where scaling was not a 
factor, and of the known robustness of the analysis of variance as a 
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technique. We believe, instead, that reporting the degree of prefer- 
ence increased the information provided by the experiment. 


A COMPARISON OF ANALYTICAL METHODS 


As noted above, the rankit transformation, or mean deviate for 
the number of choices, is based upon the same assumed normal distri- 
bution of sensations or preferences within individuals or between 
individuals as the normal deviate postulated by Thurstone and Mosteller. 
The transformation of degree of preference to rankits is a simple exten- 
sion of Scheffé’s analysis in least squares. A method which differs 
much more in procedure is that described by Bradley and Terry (1952a,b, 
1953), in which the squared hyperbolic secant density, corresponding 
to a logistic curve (Hopkins, [1954]; Gridgeman [1955]), is substituted 
for the normal density in Thurstone’s model. They assume that the 
ranks assigned to any given pair of stimuli are distributed binomially, 
and have developed a rank order technique and tables (Bradley and 
Terry [1952a]) for determining by x’ the significance of differences 
among the initial stimuli. 

The present experiment has been analyzed by the Bradley-Terry 
technique (Table 13), both with the test or replicate as the unit, and 


TABLE 13 
x? ANALYSIS OF SIMPLE PREFERENCES BY BRADLEY-TERRY TECHNIQUE 
Source D.f. x P. 
Overall comparison of treatments 3 32.550 .0005 
Effect of order of presentation 3 5.099 .16 
Other replicate X treatment interactions 12 9.182 .68 
Agreement of panel members 72 112.462 .005 


with the taster as the unit. The four combinations of the two insecti- 
cides and two fungicides in the present experiment gave a highly signifi- 
cant x” among treatments and a significant discrepancy in the response 
of panel members, but no effect either of the order of presentation of 
the two samples within each pair or of other interactions of treatments 
with replicates. Recently, Abelson and Bradley [1954] have extended 
this method to the factorial analysis of a 2 X 2 design, so as to isolate 
the individual components now grouped in the first row of Table 13. 
Their calculation is so involved, however, that its availability is limited. 
It has not been applied to the present data. 

Kendall [1955] has proposed still a different approach, based upon 
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the matrix of ‘votes’ cast for each competing treatment. Comparative 
ratings or scores can be determined either from the initial matrix of 
votes, or from the square of this matrix, which presumably reallocates 
the scores more equitably. His procedure has been applied to the total 
preferences in Table 4, with ratings not unlike those given by other 
techniques. 

All methods permit the linear scaling of treatment preferences. 
Their scales in the present experiment have been compared in Table 14 


TABLE 14 


ComPaRaTIVE ScaLE LOcATIONS OF THE TREATMENT COMBINATIONS IN TABLE 2, 
AND FacToriAL Errects COMPUTED WITH THE SPECIFIED FacTorRiaAL COEFFICIENTS 


Treatment combination | Factorial effect 
Row Scale 
Th LTh L (-—) | Th L ThXL 
1 {Mean normal deviate, Mosteller|.108 1.000 —.409 — .699/2.216 1.182 .602 
2 |Rankits for replicates (Table 6)}.110 1.000 —.410 —.700/2.220 1.180 .600 
3 |Rankits for tasters (Table 8) |.084 1.000 —.412 — .672/2.168 1.176 .656 
4 |Degree of preference, ranks .044 1.000 —.294 —.750/2.088 1.412 .500 
5 |Degree of preference, rankits |.062 1.000 —.293 —.770/2.125 1.415 .461 
6 |Ln p; , deviates, Bradley-Terry|.039 1.000 —.399 — .640|2.078 1.202 .720 
7 |Scores, Kendall matrix -111 1.000 —.426 — .685/2.222 1.148 .630 
8 |Scores, Kendall matrix squared |.181 1.000 —.441 —.740/2.362 1.118 .520 
Factorial coefficients Th 1 1 -1 -1 
for effect of * L -1 1 1 -1 
ThxL |-1 1 -1 1 


by dividing each score, expressed as a deviate from the mean score, by 
the score for the most preferred treatment (LTh). The Thurstone- 
Mosteller technique, applied to each replicate separately and then totalled 
over the six replicates, gave the ratings in row 1. Nearly the same results 
were obtained directly with rankits from the number of preferred 
choices in each replicate (row 2). With the data for each taster converted 
to the rankits for a series of 7 (instead of 26), the results diverged more 
from the other two (row 3). The scalings differed still more when the 
degree of preference was considered, whether expressed in the simple 
ranks described by Scheffé (row 4) or in rankits (row 5). The logarith- 
mic scale from the Bradley-Terry approach, based upon the total 
number of choices for each treatment, and scales based upon the scores 
from Kendall’s matrix have been added for comparison (rows 6-8). 
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The results for the four treatment combinations have been combined 
with the factorial coefficients in the lower part of Table 14 to obtain the 
relative measures of the effect of thiram, of lead arsenate, and of their 
interaction in the last three columns. The interaction was largest on 
the Bradley-Terry scale, which also gave the smallest effect for thiram 
and an intermediate value for lead arsenate. In this experiment all 
scalings agreed sufficiently, however, that convenience and compre- 
hensiveness would rank high in selecting a procedure. 

The techniques seemed to compare more or less as follows. If 
a paired comparison is based upon a simple preference and is replicated, 
analysis with rankits is simpler and more flexible than with normal 
deviates. But if the data represent a single test with many subjects, 
Mosteller’s [1951b] x” test for additivity may be necessary. There 
seems to be good reason for balancing an experiment so that it can 
be analyzed both with the replicate as the unit and separately with 
the subject as the unit. Additional information is gained by recording 
the degree of preference, despite heterogeneity among tasters in their 
error variances, and here we would prefer the, calculation in rankits. 
For practicality, the Bradley-Terry analysis depends upon special 
tables, which they have published, but it seems to allow fewer compari- 
sons than the other techniques. We have not studied the Kendall 
analysis sufficiently to warrant an opinion. 


SUMMARY 


Paired comparisons, in which each treatment is tested in pairs 
with every other treatment, are well adapted to comparative studies 
of the effects of chemically different pesticides upon the flavor of a food. 
The palatability of sauce made from apples, sprayed in a 2 X 2 factorial 
experiment with two different insecticides and two different fungicides, 
has been evaluated by this design. Six pairs of samples, representing 
all possible treatment combinations, were tasted on each of -six test 
days by the same 25 subjects, each subject recording both his choice 
within the pair and the degree of his preference as suggested by Scheffé. 

For analysis, the deviate of the normal curve in the Thurstone- 
Mosteller model was replaced by the rankit, defined as the mean deviate 
for the ranked items in a sample from a normal population with mean 
zero and unit standard deviation. Two analyses were computed from 
the number of choices for one of the treatments in each pair: (1) with 
the replicate as the unit and the rankits for a series of 26 as the variate, 
and (2) with the taster as the unit and the rankits for a series of 7 as 
the variate. The degree of preference reported for each test pair was 
also converted to rankits for a third analysis. 
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The three analyses of variance complemented one another. In 
every case, there was satisfactory additivity of treatment preferences 
on the rankit scale; the order of tasting the two samples comprising 
each pair had no significant effect upon preference; and the results on 
the six replicate or test days were mutually consistent. Sauce prepared 
from apples sprayed with lead arsenate was preferred to that sprayed 
with parathion, and, even more markedly, thiram was preferred to 
sulfur. The flavor associated with the insecticide did not modify that 
due to the fungicide, the interaction not being significant in any test. 
However, individual tasters differed in their flavor preferences for the 
direct effects of treatment, the “interaction” of tasters and insecticide 
being significant when the degree of preference was reported, and the 
‘““nteraction” of tasters and fungicide highly significant by both pro- 
cedures. Recording the degree of preference gave somewhat more 
information than the simple choice between the two samples of each 
pair, despite a significant heterogeneity in the variance within tasters. 

For comparison, the same data were analyzed in varying degree 
by four alternative methods: the Thurstone-Mosteller, Bradley-Terry, 
Kendall, and Scheffé techniques. The results have been compared in 
terms of the relative scale positions of the four treatments and of their 
factorial combinations. Despite differences in the underlying model 
and method of analysis, the treatment rankings on a preference scale 
were substantially the same. A choice among them in this case could 
be determined by convenience and the ability of the technique to extract 
the required information with the least computational labor. Rankit 
analysis seemed to meet this need effectively. 
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THEORETICAL AND EXPERIMENTAL STUDY 
OF SELF FERTILIZED POPULATIONS* 


W. HoRNER 
Agricultural Experiment Station, Ames, Iowa, U.S.A. 
AND 
CHar.es R. WEBER 
Field Crops Research Branch, A.R.S., U. S. Department of Agriculture 


1. INTRODUCTION 


Plant breeders frequently classify the material of a generation into 
groups according to the ancestry of the group. Replication of the groups 
permits the variance component due to that type of grouping to be 
estimated. Groups of one generation may be related to groups of 
another generation through common ancestry and hence covariances 
can be estimated. Such estimated quantities are functions of the 
underlying genetic mechanism and in particular of genotypic variances 
and covariances. They thus provide a basis for inference about the 
genetic mechanism. 

Genotypic variances and covariances have been defined and derived 


_ under various assumptions by numerous authors; notably Fisher [1932], 


Mather [1949], Horner [1955], Gates [1954] and Kempthorne [1956]. 
The definitions have been based on infinite populations. These may be 
divided into sub-populations whose genotypic composition and relative 
frequency are known from genetic theory. The actual magnitude of a 
genotypic variance or covariance depends on the value assigned to each 
genotype. This depends on the physiology of the expression of the 
genotype and is usually a function, simple or complicated, of the genes 
of the genotype. 

The relationship of sample covariances (or sample variance compo- 
nents) to genotypic variances and covariances is not always obvious. 
The expected values of the sample covariances may be a function of 
more than one genotypic covariance. The relationship is often further 
obscured by the fact that actual plant populations are finite and a 
define’ infinite subpopulation may be represented by only a few plants. 
The problem of relationship becomes particularly acute in populations 
which reproduce by self fertilization. These give rise to populations 
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having a hierarchical structure and hence to a whole set of genotypic 
variances and covariances. 

The objective in the present paper is to describe a formal procedure 
for obtaining the relationship of sample covariances and variance com- 
ponents to genotypic variances and covariances. This procedure is 
described with reference to an experiment conducted at the Iowa 
Agricultural Experiment Station on a soybean cross using data from 
the F, through the F; generation. 


2. DESCRIPTION OF SOYBEAN EXPERIMENT 


Homozygous soybean varieties Adams and Hawkeye were crossed in 
1947. Their F, was produced in 1948, their F, in 1949, etc. For the 
purposes of this study, soybeans are nearly completely self fertilized. 
The experiment has the structure indicated in Table 1. 


TABLE 1 
STRUCTURE OF THE DaTa 
Year Population Structure 
1949 F, Ae 
| 
1950 F; Bi By B; Bu 
| 

1951 F, Cc, CP Cy C; Ct Cu CR 
1952 Fs D, D* D, DF Di Dt Du Dz, 
1953 Fs E, E, E; Ey E%, 
1954 F, F, F* F, FF F; F* Fo Fe, 


Unselected progeny: *. Selected progeny: no star 


Ninety four random F, plants, indicated by A’s in Table 1, were 
selected. The 94 progenies in the F; generation, indicated by B’s, were 
grown in a simple lattice design together with parents and bulk popula- 
tions. Two plants were selected at random from each F; progeny. 
The 188 resulting F, progenies, which are indicated by C’s in Table 1, 
were again grown in a simple lattice design. To obtain the F; generation 
one of the two progenies in the F, generation tracing to each particular 
F, individual was selected at random. ‘Two random plants of each 
selected progeny produced F’; progenies. The cycle was then repeated. 
Selected progenies are indicated in Table | by the absence of a star. 
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The letters B through F in addition to being used to represent progenies 
will also be used to represent adjusted phenotypic progeny means, and 
the letter A will also be used to represent a measured value of an F, 
plant. 


TABLE 2 
SampLe CovaARIANCES AND VARIANCE COMPONENTS AND THEIR ExrecTeD VALUES 


Sample covariance Expected value 

A withB Cov (2; 2, 3) 

A * + Cov (2; 2, 4) 

A “ (D+ D*) Cov (2; 2, 5) 

A “ (E + E*) Cov (2; 2, 6) 

A “ (F + F*) Cov (2; 2, 7) 

B “ 4C + C*) Cov (2; 3, 4) + a Cov (3; 3, 4) — a Cov (2; 3, 4)tT 
B “ (D+ D*) Cov (2; 3, 5) + a Cov (3; 3, 5) — a Cov (2; 3, 5) 
B “ }(E + E*) Cov (2; 3, 6) + a Cov (3; 3, 6) — a Cov (2; 3, 6) 
B “ ¥F + F*) Cov (2; 3, 7) + a Cov (3; 3, 7) — a Cov (2; 3, 7) 
Cc “ (D+ D*) Cov (3; 4, 5) + 6b Cov (4; 4, 5) — b Cov (3; 4, 5) 
C “ (EF + E*) Cov (3; 4, 6) + b Cov (4; 4, 6) — b Cov (3; 4, 6) 
Cc * 3 + F*) Cov (3; 4, 7) + b Cov (4; 4, 7) — b Cov (3; 4, 7) 
D “ KE + E*) Cov (4; 5, 6) + c Cov (5; 5, 6) — c Cov (4; 5, 6) 
D “ KF + F*) Cov (4; 5, 7) + ¢ Cov (5; 5, 7) — ¢ Cov (4; 5, 7) 
E “ (FF + F*) Cov (5; 6, 7) + d Cov (6; 6, 7) — d Cov (5; 6, 7) 
c + Cov (2; 4, 5) 

ce “ (EB + E*) Cov (2; 4, 6) 

ce “ 4(F + F*) Cov (2; 4, 7) 

D* “ (E + E*) Cov (3; 5, 6) 

De “ 3 + F*) Cov (3; 5, 7) 

E* “ 3F + F*) Cov (4; 6, 7) 

Sample 

Year Population variance component Expected value 

1950 Fs Among A groups Cov (2; 3, 3)t 

1951 Fy Among B groups Cov (2; 4, 4) 

1951 Fy Within B groups Cov (3; 4, 4) — Cov (2; 4, 4)f 
1952 F; Among C groups Cov (3; 5, 5) 

1952 F; Within C groups Cov (4; 5, 5) — Cov (3; 5, 5)t 
1953 F, Among D groups Cov (4; 6, 6) 

1953 Fe Within D groups Cov (5; 6, 6) — Cov (4; 6, 6)T 
1954 F, Among E groups Cov (5; 7, 7) 

1954 F, Within E groups Cov (6; 7, 7) — Cov (5; 7, 7)t 


{Strictly speaking year X genotype interaction components should also be included in these 
expected values. 

tta, b, c and d are the reciprocals of the harmonic means of the numbers of plants in B, C, D, 
and E respectively. The harmoric means are around 100 and hence the terms involving these co- 
efficients are negligible. 
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In this experiment there were 21 sample covariances and 9 variance 
components which were unconfounded with environmental effects and 
which could be given a genetic interpretation. These are listed in 
Table 2 along with their expected values. 


3. INTERPRETATION OF 
GENOTYPIC VARIANCES AND COVARIANCES 


The meaning of the symbolization Cov (2; 2, 3) or in general 
Cov (k; n, n’) is indicated by Figure 1. Cov (k; n, n’) is the genotypic 
covariance of progenies in the nth generation from particular genotypes 
in the kth generation with progenies in the n’th generation from those 
same genotypes of the kth generation. 


FIGURE 1 
INTERPRETATION OF Cov (k; n, n’) 
Generation Genotypes and progenies 

n M 

n’ M’ 


M and M’ are genotypic progeny means of progenies in the n and n’ generations 
which are descended from genotypes in the kth generation. 


Cov (k; n, n’) = Cov (M, M’) 


All genotypic variances and covariances of populations having the 
hierarchical structure described can be expressed as linear functions of 
the Cov (k; n, n’). With this notation, the genotypic variance among 
F, plants is Cov (2; 2, 2), and the covariance of F, plants -with their 
progenies in the F, is Cov (2; 2, 3). The variance of F; progeny means 
is Cov (2; 3, 3) and the average variance within these progenies is 
[Cov (3; 3, 3) — Cov (2; 3, 3)]. 

The interpretation of the Cov (k; n, n’) is a logical consequence of 
the genetic assumptions. As an example of interpretation, consider 
Cov (3; 4, 5); that is, the genotypic covariance of progenies in the F, 
and F’, generations which trace to particular genotypes in the F; genera- 
tion. Letting p, represent the frequency of the ‘th genotype in the F; 


and letting y,. and y,,. represent the respective F, and F; progeny. 


means, then Cov (3; 4, 5) = Dos — y..)(Ys.. — y...) where y,. 
and y... are the genotypic means of the F, and F, generations. In this 
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notation the generation with which a symbol is associated is indicated 
by the number of subscripts of the symbol. Each successive generation 
has one more subscript. Taking the case of a single locus with two 
alleles, let the values of the three genotypes be z + 2u, z + wu + au 
and z. From genetic theory it is known that the frequencies of the three 
genotypes in the F’, are 3/8, 2/8 and 3/8. The genotypic progeny means 
in the F, generation are z + 2u, z + u + 1/2 au and 2; and in the F; 
generation, z + 2u, z + u + 1/4 au and z. Thus, Cov (3; 4, 5) = 
3/4u? + 3/128a7u? = (3/2)o% + (3/32)o> , where o% and gp are the 
additive genetic and dominance variances of the F, generation. 


4. DERIVATION OF THE COVARIANCE OF C WITH }(D + D*) 


4.1 Notation and model for an adjusted progeny mean: 


The covariance of C with 4(D + D*) is derived in this section to 
illustrate the derivation of covariances and variance components. For 
this purpose C’; will be represented by 


Vij 
n; 


where z;,; is the value of the jth plant in C; and n, is the number of 
plants in C;. For D; we will use 


Vij’. 


where 2, ;-, is the value of the kth plant in D; . The prime on the sub- 
script 7 denotes the fact that the D; progeny is from a particular plant 
in C; , the j’ plant say. The j’ plant is one of the two plants which was 
selected at random in C; . The other will be indicated by 7”. For 
D* we will use 


= 


The expected value of the sample covariance is 
E{Gov [C, (D + D*)]} 


1 ; 


i=1,2,---,n 


which can be evaluated most easily by making use of the equivalent 
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expression 
E{Gov [C, + D*)}} 


1) 2 BIPPY, say 
which thus reduces the problem to that of finding the expected covariance 
associated with individual degrees of freedom. 

The model for z;, is z;, = w, + 2. + €; where uw, and ¢ have the 
usual meanings and z;. is the mean of the genotypic effects in the C; 
progeny assumed to be random. The model for z,,-. is similar except 
that replaces and z,;-. replaces z;.. Strictly speaking the models 
should have an additional term for year X genotype interaction. For 
present purposes, this term is not considered. Only certain of the 
sample variance components would have their expectations changed 
under the more complex model. 


4.2 Genotypic covariance of observed progeny means: 


Assuming that the environmental effects of one year are uncorrelated 
with those of another year, then EPP’ reduces to 


EPP’ = 25°.) + 25+.) 
+ 2565.) + — 
— EQ 250.) — 25°.) 


By the procedure to be described subsequently it can be shown that the 
four terms entering with a plus sign are equal while the other four are 
zero and thus EPP’ = 2E(z,.z,;;-. ). This genotypic covariance of ob- 
served progeny means can be written as 


iMij’. 


Nn; 
which reduces to 


E(z,;2;;+4) is the genotypic covariance of a plant in C; with a plant in the- 
progeny ofasib. E(z;;-z,;-4) is the genotypic covariance of a plant in C,; 
with a plant in its own progeny. 
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4.3 Model for genotypic effect of an observed plant: 


The genotypic covariances of observed plants, E(z;;z;;-.) and 
E(z,;-2:;-.), can be related to population genotypic covariances. To do 
this let 5;) be a random variable that assumes the value one when the 
genotypic effect of the jth plant in C; corresponds to the genotypic 
effect of the uth genotype in the progeny of the fth genotype in the F, 
generation and zero otherwise. Likewise let 5/3‘ be a random variable 
which assumes the value one if the kth plant in the progeny from the 
jth plant in the progeny of C; corresponds to the genotypic effect of the 
vth genotype in the progeny from the wth genotype in the progeny from 
the ith genotype of the F, generation and zero otherwise. 

The properties of the 6’s are known from genetic theory. For example 
E(6:.) = P:Pi Where p, is the probability of the ‘th genotype in the 
F; generation and p,, is the probability of the uth genotype in the pro- 
geny of the ¢th genotype given the /th genotype. Similarly E(zii*) = 
and 


for 7’ ¥7 


for 1’ = i,t’ = t,j’ = j,andu’ =u 


otherwise 


The genotypic effects of observed plants are now related to popula- 
tion genotypic effects by the equations 


where y,, is the genotypic value of the uth genotype in the progeny of 


the ‘th genotype, y.. is the genotypic mean of the F; generation and 
Yiu» and y_.. have similar meanings. It follows that 


After simplifying the right hand side by making use of the properties 
of the 4’s, we have 


; 


Pye. — — Y...) 


Cov (3; 4, 5). 
In a similar manner it can be shown that 


E@@;;-2:;-x) = Cov (4; 4, 5). 
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4.4 Expected Value of Gov (C, (D + D*)/2}: 


This ais value is then 


However, 
F,and F 
an additio 
Cov (3; 4, 


E{Gov [C, 


where 6 is 


1) Cov 3;4,.5) +> (Cov (43 4, 


Cov (4; 4, 5) represents the total genotypic covariance of the 


5 generations. Henee Cov (4: 4, 5) contains Cov (3; 4, 5) plus 


nal covariance that might be represented as Cov (4; 4, 5) — 
5). Making this substitution and simplifying, we have 


3(D + D*)}} 
= Cov (8; 4, 5) + b[Cov (4; 4, 5) — Cov (8; 4, 5)] 


the reciprocal of the harmonic mean of the n; , which is the 


number of plants in C, . 


5. NUMERICAL EXAMPLE 


The derivations above have shown that the expected values of the 
sample covariances are linear functions of the Cov (k; n, n’); it remains 
to be indicated how actual genotypic models are fitted to the data. 
Accordingly, let Y represent a sample covariance. Then 


Y = Cov (k;n,n’) + « 


where ¢ is an error. 

For instance, if an additive model with dominance with additive X 
additive interaction, two alleles per locus, no linkage and a gene fre- 
quency of one-half is assumed, then it can be shown that 


k-1 k-1 _ 
Cov (k; n,n’) = (et + (72 + (4) 


If one makes other assumptions then Cov (k; n, n’) will be different. 


Assuming 


the above model, however, and defining 


2 2 2 
Bi = op, Bs = Gaa 


k-1 k-1 


the model for the sample covariance becomes 


and least 


and . 


Y= BX, + + +e 


squares procedures can be utilized to give estimates of 6, , 
Values of the X’s for the thirty sample covariances are 
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shown in Table 3 along with the sample covariances for the character 
maturity. This character maturity was the number of days after 
August 31 that 95 to 100% of the pods in a plot were ripe as measured 
by eye. 


TABLE 3 
OBSERVED AND EsTIMATED VALUES OF COVARIANCES 
Sample 
covariance| Estimated value 
Cov (2; 2, 3) 1.00 .50 1.00 9.7 | 10.9 11.6 11.5 
Cov (2; 2, 4) 1.00 .25 1.00 7.1 00.9 11.2 11:3 
Cov (2; 2, 5) 1,00 .13 1.00 7.3 | 10.9 11.0 11.1 
Cov (2; 2, 6) 1.00 .06 1.00 6.0 | 10.9 10.9 11.0 
Cov (2; 2, 7) 1.00 .03 1.00 7.6 | 10.9 10.9 11.0 
Cov (2; 3, 4) 100 1.00 13:2 | 10:9 2110 101.1 
Cov (2; 3, 5) 1.00 .06 1.00 14.2 | 10.9 10.9 11.0 
Cov (2; 3, 6) 1.00 .03 1.00 13.1 10.9 10.9 11.0 
Cov (2; 3, 7) 1.00 .02 1.00 13.8 10.9 10.9 11.0 
Cov (3; 4, 5) 1.50 .09 2.25 16.6 | 16.4 16.4 16.5 
Cov (3; 4, 6) 1.50 .05 2.25 13.9 | 16.4 16.3 16.4 
Cov (3; 4, 7) ‘1.50 .02 2.25 15.3 | 16.4 16.3 16.4 
Cov (4; 5, 6) 1.75 .05 3.06 18.0 | 19.1 19.1 19.1 
Cov (4; 5, 7) 1.75 03 3.06 20.2 | 19.1 19.0 19.1 
Cov (5; 6, 7) 1.88 .03 3.52 19.3 | 20.5 20.4 19.3 
Cov (2; 4, 5) 1.00 .03 1.00 11.8 10.9 10.9 11.0 
Cov (2; 4, 6) 1.00 .02 1.00 9.7 110.9 10.9 11.0 
Cov (2; 4, 7) 1.00 .01 1.00 11.4 | 10.9 10.9 11.0 
Cov (3; 5, 6) 1.50 ©2 2.25 17.0 | 16.4 16.3 16.4 
Cov (3; 5, 7) 1.50 .01 2.25 18.2 | 16.4 16.3 16.4 
Cov (4; 6, 7) 1.75 .01 3.06 16.9 | 19.1 19.0 19.1 
Cov (2; 3, 3) 1.00 . .25 1.00 19.8 | 10.9 11.2 11.2 
Cov (2; 4, 4) 1.00 .06 1.00 10.9 | 10.9 10.9 11.0 
Cov (3; 4, 4)—Cov (2; 4, 4) 50 .13 1.25 5.5 5.5 5.6 5.5 
Cov (3; 5, 5) 1.50 .05 2.25 20.0 | 16.4 16.3 16.4 
Cov (4; 5, 5) —Cov (3; 5, 5) 2 6 81 3.8 2:7 2.8 3.7 
Cov (4; 6, 6) 1.75 .03 3.06 17.1 94a 100 1.1 
Cov (5; 6, 6) —Cov (4; 6, 6) 13 .0 .45 3.0 1.4 1.4 1.4 
Cov (5; 7, 7) 1.88 01 3.52 21.8 | 20.5 20.4 20.4 
Cov (6; 7, 7) —Cov (5; 6, 6) .06 02 .24 2.4 
.9623 .9625 .9627 

Y, = 10.92N, Additive Model 

= 10.85N, + 1.43X2 Additive Model with dominance 

= 11.06N,; + 1.07N. — Additive Model with dominance and addi- 


tive X additive interaction 
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The three Y columns shown in Table 3 represent the population 
covariances as estimated from linear regression, associated with a 
completely additive model, an additive model with dominance, and an 
additive model with dominance and additive X additive interaction. 

A detailed interpretation of this set of data will not be given here. 
It might be noted, however, that the completely additive model fits 
this set of data as well as the two more complex models, explaining 
about 96% of the variation in the dependent variable. 

Confidence limits for the Cov (k; n, n’) and prediction limits for the 
sample covariances in the case of the completely additive model are 
shown in Table 4. In interpreting the data one must bear in mind 


TABLE 4 
CONFIDENCE AND PREDICTION LIMITS FOR THE COMEFLETELY ADDITIVE MoDEL 


Confidence limits for Prediction limits for 


Coefficient of o%, , population covariances sample covariances 
X; h lp 4 
.0625 .63 —4.9 6.3 
.125 1.26 1.47 —4.3 7.0 
25 2.53 2.94 —2.9 8.4 
5 5.05 5.87 — .2 | 
1.0 10.1 11.7 5.2 16.6 
1.5 15.2 17.6 10.6 22.1 
1.75 17.7 20.5 13.3 24.9 
1.875 18.9 22.0 14.6 26.3 


that certain of the estimates of the variance components may be biased 
upward because of the ignoring of genotype X year interaction. The 
estimates of the 21 sample covariances and the 9 sample variance 
components are not completely independent. Linkage may be an 
important factor in the early generations and natural selection pro- 
gressively more important in the latter generations. The F, data were 


individual plant values, while the data of other generations were based 
on plot values. 


SUMMARY 


This paper is concerned with the derivation of the expected values of 
sample covariances and variance components in terms of genotypic 
variances and covariances for populations produced from crossing two 
homozygous lines and subsequent self-fertilization. Such genotypic 
variances and covariances have been defined by authors including 
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Fisher, Mather, Horner and Kempthorne. Use is made of random 
variables such as 6; , where 6; = 1 if the genotype of the plant labeled 
z in an experiment corresponds to the /th genotype in a list of possible 
genotypes and zero otherwise. Application was made to maturity 
data of a soybean experiment. In this experiment 21 covariances and 
9 variance components, unconfounded with environmental effects, 
were estimated. A completely additive model fitted this set of data as 
well as did two more complex models, explaining 96% of the variation 
among the sample covariances and variance components. 
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IN CONTINUOUS VARIATION 


Brrcer 


Institutt for Plantekultur, Norges Landbrukshégskole, Vollebekk, Norway, and 
A.R.C. Unit of Biometrical Genetics, Department of Genetics 
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INTRODUCTION 


The principles upon which the biometrical analysis of continuous 
variation can be based, have been set up by Fisher et al. [1932] and 
Mather [1949a,b]. The method Mather gives allows the estimation of 
the additive (D), dominance (H) and environmental (FE, and E,) 
components of variation, and also the detection of linkage. Interac- 
tions, although not explicitly allowed for, can be detected as additional 
sources of error variation. 

As it has become obvious that interactions are one of the more 
important contributory factors of variation, general approaches allowing 
the estimation of these effects in different generations have been de- 
veloped by Cockerham [1954], Kempthorne [1955] and Hayman and 
Mather [1955]. The approach adopted by the last authors is extended 
in the present paper to deal with families raised after backcrossing an 
F, family to its F, and the two parents, with a view to determining 
the value of these generations for separating the effects of linkage and 
interaction. The resulting method has been tested by application to 
an experiment yielding data of this kind. 


EFFECTS OF INTERACTION AND LINKAGE IN 
BACKCROSSES OF F; TO F; AND PARENTS 


Contribution to first degree statistics 


Using the notation described by Hayman and Mather [1955] for 
expressing the effects of interaction between two genes, the mean 
expression of families raised by backcrossing F, to F, and parents 
(P,; and P,) is as shown in Table 1. The mean of F, is shown for com- 
parison. 

In the presence of linkage the mean values of F, and F, X F, ob- 
viously depend on interactions and the genic distribution between the 
actual parents, but when p= } these two effects disappear. 
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TABLE 1 
..FFECTS OF LINKAGE AND INTERACTION ON THE MEANS 


Generation Mean expression 


+ he} + + 31) avr? 

F2X Fi + hol — p) + 

X Py + dy + ha + he) + + gr) — + Jojo 
F. X P2 3[—da dy + ha + ho) + + gr) + + Joie + 


Where alternative signs appear, the upper refers to association and the lower to 
dispersion. 

p = recombination value, q= (1 —p), r = (1 — 2p). 

Notation and symbols used in this and subsequent tables follow Mather [1949a, b] 
and Hayman and Mather [1955). 


The mean expressions of F, X P’s are on the other hand dependent 
on interactions and the distribution of genes between the actual parents 
both in the presence and absence of linkage. The effects of association 
and dispersion appear as a change of sign in the interaction and additive 
terms. 

The expectations for the various scaling tests allowing for the effects 
of interactions but not linkage, have been given by Hayman and Mather 
[1955]. These have been extended in Table 2 to tests involving (i) all 
selfed generations up to F; and (ii) backcrosses of F, to F, and parents. 
The latter test is capable of detecting pure 7-interaction whereas the 
former also includes a portion of the /-type interaction. The table also 
shows two scaling tests based on second degree statistics. (cf. constancy 
of W-V in diallel cross analyses (Hayman [1954]; Jinks [1954]). 


Contribution of two interacting genes to second degree statistics. 


The contribution of two interacting genes to second degree statistics 
derived from backcrosses of F, is shown in Table 3. The corresponding 
expressions for other generations, considered later in this paper, are 
also included. To avoid terms involving products of additive and 
dominance effects the variances and covariances from the backcrosses 
of F, to parents are summed over P, and P, (cf. treatment of back- 
crosses, Mather, [1949] thus leaving three statistics, viz. the variances 
within and between families and the covariance with F, parents. Three 
corresponding statistics are provided by the backcross of F, to F, . 

The variances and covariance from F, X F, are simple in expression 
and obviously informative, the total effect of interactions appearing 
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as separate quadratic items and so being unaffected by sign. Further- 
more, V;ror; and Wy ror, include nothing but simple additive effects 
and i-interactions. They are therefore likely to provide a direct measure 
of the fixable variation. A further advantage of these statistics is their 
independence of genic distribution between the actual parents. 

In the case of summed variances and covariance from the back- 
crosses to the parents a portion of the effects of interactions still appears 
as separate items, and the covariance is unique in having all the effects 
of i- and /-interactions expressed in this way; whereas the total effect 
of the j-interactions is confounded with the additive component. 

The two variances have corresponding separate interaction expres- 
sions but different coefficients. They are, however, in complete agree- 
ment with regard to the compound terms where a portion of the j-inter- 
actions is confounded with the additive component and a portion of 
i-interaction with the dominance component. The statistics obtained 
from these backcrosses are of course dependent on the genic distribution 
in the parents. 


The contribution of two linked genes to second degree statistics. . 


The definitions of D and H in respect of two linked genes for different 
generations are given by Mather [1949]. Using his notation the expecta- 
tions of the additive and dominance components in the statistics 
obtained from the present backcrosses of F, are given in Table 4. 
Compared with linkage terms for D and H in other generations pre- 
sented earlier and also given in the table, these six new statistics require 
only one further definition of D and H for complete specification, viz. ° 
Vorori 


D = 3{d2 + d? + 2d,d,(1 — 2p)(1 — 3p)] 
and 
H = + hi + — — p)] 


ESTIMATION OF THE EFFECTS OF INTERACTION AND LINKAGE 


Separation of linkage and interaction effects 


The original analysis into D, H, E, and E, components of variation 
and the subsequent tests for linkage and residual interaction, do not 
include interactions as components, whereas a modified method 
proposed by Hayman and Mather [1955] takes interactions explicitly 
into account. They point out one important distinction between linkage 
and interactions in expression, viz. that in each mating system linkage 
terms change with rank of statistic but not with generation, whereas 
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interactions tend rather to cause heterogeneity between generations 
within ranks. They show, however, that in those generations where 
i-interactions do not affect the definitions of D and H this type of 
interaction may mimic linkage effects. 

The successful separation of linkage and interaction effects depends 
of course on their independence in expression. Accordingly the problem 
has been reexamined by calculating the correlation between linkage and 
interaction expressions for the statistics under the conditions outlined 
below. The fourteen statistics involved include the six under considera- 
tion, six deriving from selfed and biparental progenies of the third 
generation and the remaining two from F, and first backcrosses. 

Duplicate and complementary types of interactions are considered 
as defined by the following conditions: 


Duplicate: Complementary: 
d, = d, =h, = h, a — Fie) d, = d, =h, = h = Ba, 
= Jats lias = Jars = = 


A further instance is considered where the description of the dupli- 
cate type of interaction is modified to allow for the absence of dominance, 
ie. h, = h, = 0. 

In all cases the recombination value p = }. 

The results of these calculations are summarised in Table 5 and 


TABLE 5 
CORRELATION BETWEEN LINKAGE AND INTERACTION EXPRESSIONS 


Type of interaction 
Gene distribution 


“Duplicate” 
Duplicate Complementary no dominance 
Association 0.672 —0.599 0.187 
Dispersion —0.387 0.586 0.044 


indicate that under certain circumstances up to approximately 40% 
of variation legitimately attributable to interaction may be taken up 
by allowing for linkage and vice versa. 


The estimation of linkage and interaction 


Sets of data appropriate to the statistics under consideration can 
be analysed by the inclusive/exclusive form of analysis outlined by 
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Mather [1949] and Mather and Vines [1952]. Thus fitting least squares 
estimates of the four components D, H, and EF, and E, will yield a 
S.S. for deviations depending on the effects of linkage and interaction. 
After fitting extra parameters to allow explicitly for linkage, any 
significant reduction in this residual S.S. must indicate the-presence of 
linkage effects, provided there is no correlation between linkage and 


' interaction in expression. The S.S. remaining after allowing for linkage 


can be ascribed unambiguously to interaction effects. 

A correlation has however been demonstrated between the effects 
of corte iypen of interaction and those of linkage. Allowing for linkage 
in the presence of interactions and vice versa may therefore lead to an 
inflation of the primary effect at the expense of the residual variation. 
The solution obviously would be to carry out two analyses, one allowing 
explicitly for linkage and the other allowing explicitly for interactions. 
Then in each case the remainder item could be attributed unambiguously 
to the effect not accommodated. This would leave a balance of varia- 
tion (i.e. the difference between the inclusive residual S.S. and the sum 
of the two remainder items) which could not be assigned specifically 
to either effect. The complete specification of all digenic interaction 
effects requires however, nine extra parameters whereas linkage effects 
require but three (Tables 4 and 6). Therefore a sequential analysis 
of the kind given in Table 7 would seem easier and has in consequence 
been adopted. A + entry in the upper part of this table indicates a 
significant item as compared with basic error while +-+ indicates a 
mean square significantly greater than the residual item. The lower 
part of the table gives the interpretation. 

By allowing first for linkage no difficulty in interpretation arises in 
the cases shown in the first three columns and further analysis would 
not be justified. The fourth case is, however, obscure and in this 
instance the data should be reanalysed allowing explicitly for inter- 
actions in place of linkage. The final results are then interpreted 
according to the lower part of the table. 


ILLUSTRATION BY EXPERIMENTAL DATA 


Structure of the experiment 


All the data used in this account are from selfed generations and 
backcrosses of F, to F, and parents, derived from the cross between 
varieties 1 (denoted P,) and 5 (P.) of Nicotiana rustica. (Mather and 
Vines [1952]). 

In 1953 50 plants taken at random from the F, were each selfed 
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The mean heights and flowering timee are given in Table 5 
standard errors attached te the moans are calculated from the v 
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| Flowering time 
Generation | Height Goches) (days irom 1/7) 
P; 44.9 49.93 $2.1 + 0.95 
P; | 4.7 1.94 35.5 0. 
43.2 + 0.78 23.8 
16.3406 31.2 C.& 
Fy . 46.7 | $2.8 + 1.4 
P, j 64.7 34.4 + 0.78 
46.249.3 33.5 20.8% 
¥, X 63.8 + 9.46 34.8 + 0.46 
xX F, 45.% + | 34.8 + 5.51 


M average of Py and P,. 


Since in previous data from the same crosses, no sicnifenn: 


differences have been found, the results from reciprocal crosses 


pooled in this account. 


The results regarding the mean values for the teo characiers ; 
r finding rs, 


mn satisfactory agreement with the ea 
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Departure from additivity of the genes concerned 
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The analysis of variation in height and flowering time 


The method used in the analysis of the variation, as revealed by the 
different second degree statistics, is that described by Mather [1949]. 


Height. The components of variation D, H, E, and EF, are estimated 
by least squares from the twelve second degree statistics shown in 
Table 10. The estimated valucs are given in Table 11 and the result 
of the analysis of variance, based on differences between observed and 
expected values, in Table 12. 

The analysis does not present any difficulty in interpretation. Fit- 
iing the four main components accommodates the major portion of 
variation. The item for residual disturbances which contains other 
sources of variation, i.e. linkage and interactions, is not significantly 
greater than the basic error, indicating that any further analysis is 
unjustified. 

The results will be discussed further below. 


“lowering Time. After fitting the four main components, the S.S. for 
deviations is still significantly higher than the basic error. This indi- 
cates the presence of linkage or interactions, or of both effects. 

Linkage is accommodated by fitting D and H components according 
to their linkage terms (Table 4).. As Varo; is unique in respect of 
linkage, this statistic is allowed to take its own value in the estimation, 
leaving two D-components (D, and D,) and three H-components 
(H, ,H,and H,;). The result of the analysis is given in Table 13. The 
significant reduction in the residual 8.8. obtained in this way might be 
taken as suggesting the presence of linkage effects. However, since 
the significant remainder item indicating interaction effects is of the 
same order as the linkage item, linkage cannot be inferred on this 
evidence alone (see Table 7). 

Under these conditions the next step in the proposed sequential 
analysis is to allow specifically for interaction effects. The composition 
of the statistics available, in terms of additive, dominance, interaction 
ind environmental effects is shown in Table 6. It will be observed that 
the range of statistics does not allow the complete solution with the 
nine extra parameters required for the full specification of all digenic 
niteraction effects. IIence an analysis allowing for interactions in a 
-implified form has been undertaken. This simplified form of analysis 
's designed to include duplicate and complementary types of interaction . 
as defined in the section on separation of linkage and interaction effects. 
Where alternative signs appear according to distribution of genes, 
the association form is used. 

The result of the of varianec is in Table 13. As 
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TABLE 10 
OBSERVED VALUES FOR THE STATISTICS 


Height Flowering time 


Statistic 


Block Block 


Vir2 

Wirzrs 

Virs 

+ Vipop2 
+ 
Virori 

Wir 2F1 

Vors 

Voropi + Vorope 


SESESRS 


TABLE 11 
EstTIMATES OF COMPONENTS OF_ VARIATION IN HEIGHT 


28.14 + 2.01 
15.24 + 6.66 
14.06 + 0.97 
11.43 + 0.74 


Df. = 12 


TABLE 12 


ANALYSIS OF VARIANCE IN HEIGHT 


Item 


Total 

Main Components 
Residual disturb. 
Main comp. X block 
Residual block 
Duplicates 


428 
I Il I II 
30.23 30.22 40.98 59.59 
17.28 16.47 22.73 23.96 
29.06 29.73 54.71 67.92 
30.52 34.21 50.13 49.96 
11.39 9.96 25.73 23.00 
17.96 15. 24.27 27.41 
9.71 7. 11.95 14.15 
ua 22.98 20. 48.62 49.72 
Jee J 38.50 43. 75.32 74.85 
31.88 26. 55.18 69.14 
4 E, 11.97 13. 17.23 22.77 
E, 6.81 12. 12.03 9.81 
4 
D 
ind H 
E, 
E; 
7 
Df. Ms. P 
24 
4 3383.04 <0.001 
8 10.22 >0.05 
4 5.27 
4.91 
| 12 5.03 


UT JOIIO Jo SMOYS 4894 


sazvordng 

yoojq X 62° X “prsey 
yoolq X yooyq X 
X 89° SF X 
FE 


4 
g 


N 


“ 


‘SW td 


(SISA[BUY 
ONIUAMOTY NI AONVIUVA dO SISAIVNY 


RACTIONS AND LINKAGI 


4 


INTE 


| 
vgs 
oo 
= 
4 
> 
| 
‘a 
| : 
| 
: 
he 
fa) 
2 


430 BIOMETRICS, DECEMBER 1956 


in the previous case, the analysis is based on the deviations of observed 
values from expected for the twelve second degree statistics available, 
the expected values being calculated from the exclusive components 
(Table 14). It can be seen that the disturbances remaining after evalu- 


TABLE 14 


EsTImMATEs OF ExcLustvE COMPONENTS OF VARIATION IN FLOWERING TIME 
(Exc ustve Estimation) 


Allowing for linkage Allowing for interactions 
D, 57.68 + 6.29 D 16.89 + 13.25 
D; 134.61 + 30.58 D, —22.86 + 35.15 
mM 4.66 + 25.67 H —28.28 + 49.56 
Hz —36.67 + 62.16 mM 125.62 + 42.73 
H; —22.18 + 24.78 I 59.67 + 22.92 
E, 20.13 + 3.70 22.26 + 3.06 
E, 20.58 4+ 2.53 E, 15.35 + 2.43 
DA. = 12 


Definitions of D and H components are shown in Tables 4 and 6. 


ating the effects of interaction are still significant. The results are 
discussed in the next section. 


DISCUSSION 


The height data from this experiment show a relatively large and 
highly significant additive component, indicating that although the 
phenotypic difference between the parents is barely significant, the 
genotypes are by no means alike. Furthermore, a rather strong sug- 
gestion of heterosis in F, combined with the absence of non-additive 
gene effects supports the view that genes affecting the character under 
consideration are present dispersed among the homozygous parents. 
The significant dominance component H and the dominance expression 
VH/D = 0.74 suggests a relatively high degree of dominance, but 
not overdominance. 

Mather and Vines [1952] in an earlier experiment involving different 
generations from the same cross found evidence of non-allelic inter- 
actions in the height data, the experimental material being at this time 
grown at Merton, South London. Their experiment was continued, 
however, at Birmingham, where no suggestion of genic interactions 
could be detected in the same material, which is in accordance with the 
present results (also obtained at Birmingham). This apparent change 
in the properties of individual genes has been discussed by Breese [1954]. 

The presence of residual disturbances affects in general the absolute 


LSS. 
| 


INTERACTIONS AND LINKAGE 431 


and relative magnitude of D and JI as estimated, and thus also the 
degree of dominance. A comparison of dominance relationship can, 
however, be made where the effects of interactions are removed. Diallel 
data which included P, and other varieties provide such an estimate of 
V H/D which indicates complete dominance (Jinks [1956]). This is 
in satisfactory agreement with the present result. 


Flowering time. The indication by the scaling tests of the presence of 
interactions is confirmed by the analyses of variance where allowance 
is made for linkage and interactions respectively, in the former analysis 
by a significant residual mean square and in the latter by significant 
interaction components. 

Less confidence can, however, be placed on the suggestions of 
linkage effects in these analyses. In the first analysis, the linkage item 
is not significantly greater than the residual mean square and so cannot 
be regarded as anything more than a reflection of interaction effects 
(see also Mather and Vines [1952]). In the second analysis, since only 
a simplified form of interaction is considered, the significant remainder 
item still cannot be related to linkage any more than to unaccommo- 
dated interaction effects. Furthermore, allowing for interactions in the 
form specified reduces the residual disturbances item by a greater 
amount than does allowing for linkage. Once again the presence of 
interactions rather than linkage effects is suggested, although the 
latter cannot be entirely ruled out. ; 

As the presence of significant residual disturbances affects the 
size of the additive and dominance components little weight can be 
placed upon the estimated values in Table 14. The insignificance of 
D (= > @) is probably due to large error whereas the apparent dis- 
agreement between H, (= >> h,l) and H (= >> 22) emphasises the 
inadequacy of the underlying assumptions. 

The negative value of D, (= >> d,j), although not significant, 
suggests the presence of a duplicate type of interaction. The presence 
of interactions in flowering time in Nicofiana rustica; probably of a 
duplicate type, is also found in data from diallel crosses (Jinks, [1954], 
Breese, unpubl.). Mather and Vines [1952] did not find similar effects, 
but their data showed a significant mean square for linkage. Different 
manifestations of genes in changing environments may, however, well 
be the reason’ why data disagree on this point (Breese, [1954]). 


SUMMARY 
Following the methods of Mather [1949a,b] and Hayman and 
Mather [1955] formulations including additive, dominance, linkage and 


interaction terms have been obtained for first and second degree statistics 
deriving from a backcross of an F, family to its F,; and parents. The 
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experimental recognition of each effect is discussed and the conditions 
necessary for the separation of interaction and linkage considered. A 
procedure which enables the discrimination of these latter effects is 
outlined. 


An experiment on Nicofiana rustica set up to illustrate the methods 
has been analysed for the two characters measured (height and flowering 
time). 

In height there is no suggestion of heritable effects other than addi- 
tivity and dominance. On the other hand, analyses involving both 
first and second degree statistics strongly suggest the presence of non- 
allelic interactions in flowering time. Since the number of available 
statistics is not sufficient for the simultaneous assessment of all digenic 
interactions and linkage effects, complete separation of these effects 
is not obtained and the detection of linkage is subject to uncertainty. 


ACKNOWLEDGEMENTS 


I wish to thank Professor K. Mather for supplying facilities and 
data and for helpful criticism. I am also indebted to Dr. Breese, Dr. 
Hayman and Dr. Jinks for assistance throughout the work. 


REFERENCES 


Breese, F. L. [1954]. Genotype-environment interaction. Ph.D. Thesis. University of 
Birmingham. 

Cavalli, L. L. [1952]. An analysis of linkage in quantitative inheritance. 135-144. 
Quantitative inheritance. (ed. FE. C. R. Reeve and C. H. Waddington). H.MLS.O., 
London. 

Cockerham, C. C. [1954]. An extension of the concept of partitioning hereditary 
variance of analysis of covariances among relatives when epistasis is present. 
Genetics, 39: 859-89. 

Fisher, R. A., Immer, F. R. and Tedin, O. [1932]. The genetical interpretation of 
statistics of the third degree in the study of quantitative inheritance. Genetics, 17: 
107-24. 

Hayman, B. I. [1954]. The theory and analysis of diallel crosses. Genetics, 39: 
789-809. 

Hayman, B. I. and Mather, K. [1955]. The description of genic interactions in 
continuous variation. Biometrics, 10: 69-82. 

Jinks, J. L. }1954]. The analysis of continuous variation in a diallel cross of Nicotiana 
rustica varieties. Genetics, 39: 767-788. 

Jinks, J. L. [1956]. The F, and backcross generations from a set of diallel crosses. 
Heredity, 10: 1-30. 

Kempthorne, O. [1955]. The theoretical values of correlations between relatives in 
random populations. Genetics, 40: 153-167. 

Mather, K. [1949a]. Biometrical yenetics. Methuen and Co., Ltd., London. 

Mather, K. [1949b]. The genetical theory of continuous variation. Proc. 8th Int. 
Cong. Genetics. Hereditas, Suppl. Vol., pp. 376-401. 

Mather, K. and Vines, A. [1952]. The inheritance of height and flowering time in a 
cross of Nicotiana rustica. Quantitative Inheritance. (ed. E. C. R. Reeve and C. H. 
Waddington). pp. 49-80. H.M.S.0. London. 


i 
| 
} 
4 
| 


APPLICATIONS OF THE k, STATISTIC TO GENETIC 
VARIANCE COMPONENT ANALYSES* 


D. Rosson 
Cornell University, Ithaca, N. Y., U.S.A. 
1. INTRODUCTION AND SUMMARY 


The distinguishing property of the k, statistic is that it estimates the 
fourth population cumulant. If {Y,;},7 = 1, --- , n is a sample of n 
independent observations on the chance variable Y then the statistic 


n = 


has the expectation 
Ek, Y,;) = E(Y — — 3{E(Y — = K,(y) 


This paper will present two essentially distinct applications of this 
property of the k, statistic to genetic variance component analysis, 
(i) in the unbiased estimation of the variance of estimates of variance 
components and (ii) in the estimation of the number of genes or factors 
controlling the inheritance of a quantitative character under the addi- 
tive model with dominance. 


2. UNBIASED ESTIMATION OF THE 
VARIANCE OF ESTIMATES OF VARIANCE COMPONENTS 


The fact that k, may be used to construct an unbiased estimate of 
the variance of a sample variance is well known and follows from the 
fact that the statistic 


(Y; 9)” 


has variance 


(1) Ver) = + 2 Ky). 


*Paper No. 332 from the Department of Plant Breeding, Cornell University. 
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So the statistic 


(2) 


n+1 
has the expectation 
= Visv.) 


This result is exact and holds for any probability distribution, the only 
assumption being that Y, , Y,., --- , Y,, are independent and identically 
distributed. We shall now indicate the extension of this result to the 
estimation of the variance of variance component estimates for a nested 
classification typical of the inbreeding experiment. 


2.1 The one-way classification in general 


The statistical model upon which our genetic variance component 
analysis is based is the following. A collection (7, , 7. , +++ , x) of 
probability distributions 2,(x) of the chance variable X are mixed 
according to the probability distribution p = (p, , po, +-* , Par) to 
form the mixed distribution z, , 


A sample of mn observations from this mixed population is taken 
according to the following sampling plan: m populations are independ- 
ently selected according to the distribution p and from each selected 
population n independent observations on X are taken. This model 
describes the situation, for example, where m F’, individuals are selected 
without regard to phenotype and then selfed to obtain n F; progeny 
from each selected F, ; the chance variable X in this case represents 
phenotype measured in the F; generation. 

The mean value p of the mixed population z, is simply the weighted 
mean of the individual population means, , , , Bar thus, 


M 
- 
1 
Likewise, the variance o” in the mixed population ean be written 


M M 
> + Pier — 


where a; is the varianee of X in population x, . More briefly, we write 
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where 
M 
1 


M 


With this notation the variance component analysis based on the mn ob- 
servations X,;; ,7 = 1, --- ,m,j = 1, --+ , n, may be written as in 


ll 


Table 1. 
TABLE 1 

VARIANCE COMPONENT ANALYSIS FOR THE ONE-Way CLASSIFICATION 

“= i 
Source of variation D.f. | M.s. Mean square expectation 
Between populations m—1 M, a? + nor 
Within populations m(n — 1) Me a 

M, —.M, 

Variance component estimates: M,, 3 = — 


The variance component estimates listed in Table 1 are obviously 
unbiased and their variances are easily computed. 


m (X;; #,)° 


1 
T(=2\ _ i=) 
V@)=V mo n—-1 
or letting 
(Xi; — 
:) 
Vis’) = 


where si , s2, *:* , 8m are independent and identically distributed. 


Hence, 
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Vis’) 
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where 
M 1 
fx 1 on + nr K 
- 2 le 
and 


= Salo? — 


An obvious unbiased estimate of 3’) is given by 


(3) 


The variance of the estimate s, is complicated by the fact that, in 
general, the mean squares M, and M, are not independent. Thus, 


= : M, :) 


{V(M,) + ViM.) 2Cov (M, , 


The mean square M, may be written 


where Z, , # , -** , £, are independent and identically distributed; 
hence, by (1), 


V(M,) = + Ke} 


where 


= («: + 


= Kulu) + Cov ~ 
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An unbiased estimate of V(M/,) is given by applying equation (2) to the 
sample.means , , ; thus, 


9 2 M, 


The covariance of the mean squares /, and M, can be shown to be 


( 
Cov (M, , M,) = m — 


3 2 l= 


and an unbiased estimate of this covariance is 


Cov (M, , M,,) = — — (@; — 


An unbiased estimate of V(s;) is therefore 
(4) Dist) = + — 2 Cov (M, , 


It should also be noted that an unbiased estimate of the variance among 
the “within population” variances, V(o;), which is of interest in some 
genetic studies, is available in the form 


2.2 The One-Way Classification in Genetics 


In the genetic situation where X measures phenotype the variance 
of X is sometimes further partitioned into genetic and environmental 
components. A third component of variance could be introduced into 
the general one-way classification simply by introducing a mixture q 
of populations of the form z, which, in the genetic model, could be 
paralleled by a third sampling stage in which each of the mn F; organisms 
is replicated r times. Such a scheme, however, would require that the 
organisms reproduce asexually in order to obtain r replicates of any 
given individual. We shall assume, instead, that the scale on which 
the phenotype X is measured is so chosen as to make environmental 
effects independent of genotype. In this case we may, in addition, 
suppose that the » individuals of each family appear together in an 
experimental unit or plot, since plot effects, which are environmental, 
are then independent of family effects. In an actual experiment. this 
assumption of independence of environmental and genetic effects would 
be tested by including in the experiment two or more replicates, or 
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plots, of several genetically different non-segregating generations such 
as P, , P, , and F, . If r plots of a nonsegregating generation and m 
plots consisting of m independently selected F;, families appear in an 
experiment then the variance component analysis takes the form given 
in Table 2. 


TABLE 2 
VARIANCE COMPONENT ANALYSIS FOR THE GENETIC ONE-Way CLASSIFICATION 


Source of variation a. .s. |Mean square expectation 


Between F; families a + + 02) 
Within F; families 

Between reps of nonsegr. generation 
Within reps of nonsegr. generation 


Variance component estimates: 


M., 


» _ (Mi M2 (M, 


n 


Mi M., 


The average environmental variance within plots is denoted by ¢. and 
the environmental variance among plots (totals of n items) is then 
denoted by nc + naz. The component 4; denotes the average within 
family genetic variance and no; + na, denotes the genetic variance 
among family totals based on n independently selected progeny. 
Unbiased estimates of the variance of the estimates given in Table 2 
may be constructed by essentially the same argument used before, 
since the mean squares (M, , M,) are independent of (Mj, MZ). For 
example, j 


V@) = + V(M.,) 
where V(MZ) and V(M,) may each be estimated by (3), and 
Vie) = (VO, — MD + VOM, 


where V(M{'— MZ) and V(M, — M,) may each be estimated by (4). 


2.3 Extension to a Nested Ciassification 


The extension of the general one-way classification variance com- 
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ponent analysis to a general hierarchal classification may be indicated 
by the introduction of a third level to the model underlying Table 1. 
As mentioned earlier, this may be accomplished by introducing a mix- 
ture g = , G2, , Of populations of the form of 7, ; i.e., of 
populations which are themselves constructed by mixing a group of 
populations. If the sampling plan for this model consists of (i) the 
independent selection of r groups, (ii) the independent selection of m 
populations from each of the r selected groups, and (iii) n independent 
observations on X from each of the rm selected populations, then the 
variance component analysis may be written as in Table 3. 


TABLE 3 
VARIANCE COMPONENT ANALYSIS FOR THE 3-LEVEL NESTED CLASSIFICATION 


Source Dif. M.s. |Mean square expectation 


Between groups of populations r-1 M, a + na? + mnoZ 
Between populations within groups Ms a + naz 
Within populations within groups 


Variance component estimates: 


The variance components listed in Table 3 are defined by 
R Mi 

R Mi Mé 2 


where u,; is the mean of population z,; , o;; is the variance in population 


ll 


™:;, Pi = (Pir, *** y Pim.) is the mixing distribution defined on the set 
, to form the mixed population pies; and 
= (4, °°: , 9x) is the mixing distribution defined on the set 


Unbiased estimates of the variance of the estimates given in Table 3 
may be constructed by extending the arguments which Jed to the un- 
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biased estimates (3) and (4). An extension of the argument leading 
to (3) is obtained by noting that the statistics 


= 77 i=1,-: 
n—1 


satisfy a one-way classification model entirely similar to that of Table 1, 


so that an analysis of variance of the s;; (Table 4) yields an unbiased 
estimate of the variance of &. 


TABLE 4 


VaRIANCE CoMPONENT ANALYSIS OF THE “WITHIN POPULATION” 
SAMPLE VARIANCES 


Source D.f. M.s. |Mean square expectation 
Among groups Visi) + mV (3?) 
Within groups r(m—1)| V(s?;) 
= 2 -2 S, S, 
Variance component estimates: Vis;;) = 8S, Vie) = en 


The interpretation of the variance components listed in Table 4 is as 
follows. 


Vi) = + 


According to this notation, then, V(M,) = V(s?;) + V(é’), so that an 
unbiased estimate of V(5’) may be computed as 


S, 
m 


V@) = S,+ 


An unbiased estimate of V(M,) may be obtained directly from (3) as 


j=1 — j=} 
rr — 1) m—1 r(m — 1) 
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Likewise, an unbiased estimate of V(J/,) may be obtained directly 


from (2), 


If M,, , M,,, denote the between and within population mean squares 
respectively for group 7 then M, = (1/r) p> M,, and M, = (1/r) > 
M,,,, 80 that the covariance of M, and M, is simply (1/1) Cov (M;, , 
M,,) and is estimated without bias by . 


Cov (M, , M.) = (= M,,M., — 
The covariance of M, and M, may be estimated without bias by 
Cov (M, ’ M,) = (r — 2) #)’ 


since the ¢;; of the present model satisfy the independence requirements 
of the X,,; of the earlier one-way classification. 


3. ESTIMATION OF GENE NUMBER 


An aid to the interpretation of genetic variance components is 
provided by the various genetic models that have been proposed for 
describing quantitative inheritance. We shall consider the additive 
model with dominance as described by Fisher et al. [1932] and later 
expanded upon by Mather [1949]. According to this model there is 
independence of gene action between loci but not between the two 
alleles at the same locus. The difference in effect between the two 
homozygous genotypes at the 7th locus is denoted by 2d; , and the 
deviation of the effect of the heterozygous genotype from the mean 
effect of the two homozygotes is denoted by h; . If P, and P, are two 
pure (homozygous) lines whose genotypes with respect to the given 
quantitative character differ at N loci then if Pp, , P, denote the pheno- 
typic means of P, and P, , respectively, we may write 


1(P, — P,) = ad, 


where 6; is +1 depending upon which of the two homozygous allelic 
pairs appears at the ith locus in the pure line P, . Similarly, if F, 
denotes the mean phenotype in the F, population obtained from the 
cross P, X P, then 


N 
— + P.) = hi, 
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and unless further complications arise 
= HPL +P) +4 Dh 
B, = 3 Dh 
B, = +P.) +3 Dh 
and so on for the various derived generations. Furthermore, in the 
absence of linkage the various genetic moments of interest may be 
expressed as linear functions of the parameters >> d; , >> hi, >. 6.d;h, , 
> di, hi, . For example, in the F, generation 
Dd -4 
in the first backcross generation B, = F, X P, 
+4 Dd aah 

Ks, = + ad), 

while in the B, generation 
- 4D sah 

K,, = —3 (h 
In the F, generation we would have 

=e Dh't+ Dd’. 


If the above interpretation is given to the various genetic moments 
then it is clear that a genetic experiment may be designed which, when 
analyzed by the methods of Section 2, will yield estimates of all the 
parameters described above for the additive model with dominance. 
It is then possible to estimate a lower bound on the number 2N of 
segregating alleles. 

Mather [1949] discusses several methods of estimating N which 
are based on the general inequality >>) yi — (1/N)(3>1 y,)? > 0, or 
N > (01 y’. For example, he suggests the estimates 

6d)” ( >» h)? 


=> 
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or, as a second alternative, 
Mem, 


EP 
where d+ h° estimates the average genetic variance ¢; within 


F, families and >> d* + h* + 3s estimates the variance 
V(o;,) among the within F, family variances. Estimates similar to 
N, and N, could be computed from other generations as well; in each 
case the quantity being directly estimated has the general form 


(a5,d; + | 


N, =- N 
> (a6,d; + bh,)’ 
bP (ad? + ont | 
N,==+ 
(ad; + bhi)? 


where a and b are given constants, and the lower bound property is 
based on the inequality yi — y,)* > 0. 

Lower bounds superior to N, and N, , respectively, may be con- 
structed by utilizing the general inequality 


Thus, letting x; = 6,d; and y; = h, we have 


a 
| 
N 1 N 2 
zy | N 1 ( N 2 N 1 N 
2 2 
which is equivalent to ates 
N N 2 N 2 sN N N N 
2 2 
1 1 1 1 1 1 
Ny = — N N N 2 <N L Pty 
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which may be shown to reduce to the inequality 


[a Sh DY Sd — — Yh > 0. 


Likewise, then, letting x; = d; , y; = h; we have 


<N. 
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NOTE ON WALD’S METHOD OF FITTING A STRAIGHT LINE 
WHEN BOTH VARIABLES ARE SUBJECT TO ERROR 


E. 8. Kerrine 
University of Alberta, Edmonton, Canada 


1. Introduction 


If x and y are correlated stochastic variables measured on individuals 
forming a random sample, the ordinary regression equation Y = a + bz 
is the best linear predicting equation for y, given x. The other regression 
equation X = a’ + Db’ y is the best linear predicting equation for z, - 
given y. If certain x values are selected by the experimenter, the first 
equation must be used not only for predicting y given x, but also (in 
inverted form) for predicting xz, given y. Neither equation, however, 
gives as a rule the best structural relationship between z and y. A. Wald 
[1940] has described one simple and useful method for calculating such 
a relationship and for estimating the error variances of x and y. It 
has been noticed in a text-book exercise (Kenney and Keeping [1954]) 
that these variance estimates may turn out to be negative, and it is of 
some interest to consider why. 


2. The Wald method 
Wald’s assumptions are: 


(i) = X; te ,y; = Y; + 2;,1,j = 1,2, --- ,N, where X,, Y; 
are the expected values, and ¢; , n; the errors, for x; and y; respectively; 
(ii) the e; are uncorrelated and have a finite variance o. ; the 7; are 
uncorrelated and have a finite variance o; ; the e, and 7; are uncorrelated; 
(iii) Y; = a + BX; ; 

(iv) N is even (= 2m), for simplicity; 
(v) the e; are small enough so that’ the ordering of the observations 
according to increasing xz; and according to increasing X; are not 
substantially different. 

If the observations are ordered so that 


S% Say, 
consistent estimates of a and 6 are given by 
B= — — 4%), 
— bi, 
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where 


m N 
z,/m, i, = xz;/m, & = + #,)/2, (3) 
and similarly for the y’s. The points (, , 9,) and (# , J.) on a scatter 
diagram of observations are the centres of gravity of two groups of 
dots, each comprising half the observations. The line of best fit joins 
these two points. 
Consistent estimates of o% and o; are given by 


= — s,,/8)/(N — 1) (4) 
6, = N(s; — Bs,,)/(N — 1) (5) 


where s; , S, , Sz, are respectively the variances of x and y and the co- 
variance. 


3. Numerical example 


In the text-book exercise referred to, where x and y have the follow- 
ing values: 


z |40 41 43 45 47 50 68 77 80 90 100 100 


y 65 60 43 63 85 60 48 56 74 53 91 98 
we find = 264 but < 0. 
In the ordinary regression equations the slopes b, 6’, are given by 
b=s,,/s;, = 8,,/s (6) 
so that 
(N — = Nsi(1 — b/8), 
(N — = — b’B). 
The conditions for useful estimates are therefore 
0<b<f<1/b’ or 1/ <8 <b<0 (7) 


which mean that the Wald line should lie between the two regression 
lines. This is not true in the above example. 

As Wald has pointed out, alternative estimates of o2 and o; with 
N — 2 instead of N — 1 degrees of freedom, are obtainable from the 
variances and covariances calculated from the two groups of observa- 
tions taken separately. These estimates have the advantage of being 
distributed (when ¢ and » are normal) independently of the estimated 
slope 


WALD’S METHOD 


They are given by 


(N = 2)6 N(s?? (8) 
(N — = — 81,8), (9) 


where 
m N 
Ns?” (x; — + (x; 2)" 
1 m+1 


and similarly for the others. By analysing the variances and covariances 
into portions within groups and between groups it is easy to see that 
these estimates are just those which we should naturally make, using 
mean squares and mean product within groups. 


Variance of x S. s. Des. M. s. 


Within groups Ns? N-2 92. 
Between groups (N/4)(#, — 1 5167 


Total Ns N-1 553 


‘Variance of y S. s. D. f. M. s. 


Within groups Ni” N-2 311 
Between groups (N/4)(g. — 9)” 1 161 
Total Ns N-1 298 


Covariance 8. p. D. f. M. p. 


Within groups 119 
Between groups (N/4)(@. — %,)(G2 — jj) 1 913 


Total N-1 191 


The numerical values given in the last column are for the example 
cited above. They yield ¢; = 291, 6? < 0. 


4. Discussion 


The quantities b, b’ and 8 are all ratios of a mean product to a mean 
square, (total cov.)/(total var. x), (total cov.)/(total var. y) and (‘‘be- 
tween” cov.)/(“between” var. x) respectively. The failure of the 
Wald estimate for ¢¢ is therefore seen to be due to the small covariance 
between groups, compared with the corresponding variance for z, 
which in turn is due to the small ‘‘between” variance of y (the covariance 
| 


i 
é 
F 
a3 
he 
| 
| 


448 BIOMETRICS, DECEMBER 1956 


being the geometric mean of the x and y variances). The 90% confidence 
limits for 8, calculated by Wald’s method, in fact include zero. 

An estimate of é¢ , though probably too large, can be obtained from 
the ‘‘variance of estimate” about the x on y regression line. This 
estimate is Ns2(1 — r’)/(N — 2) = 474. If the distiibution of the 
true values X; is known, an estimate can be calculated from the relation 


8: = sx + — 1)/N. (10) 


Thus if we suppose that the true values X,; are uniformly spaced at an 
interval h, the variance sy is h?(4m? — 1)/12. If in the example above 
we take h = 5, X; ranging from 37.5 to 92.5, the estimate obtained is 
é = 228. 

It is clear that in this example the fifth of Wald’s assumptions is 
not fulfilled, the e; being much too large. The simplicity of the method 
may easily lead to its use in situations where this use is not warranted. 
In such situations, however, it is a waste of time, except as a mere 
exercise, to attempt to fit any line at all, and little confidence could 
be placed in any prediction based on such a line. 
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CORRECTION TO “THE USE OF MITSCHERLICH’S REGRES- 
SION LAW IN THE ANALYSIS OF EXPERIMENTS WITH 
FERTILIZERS” 

FREDERICO PIMENTEL GOMES 
University of Sao Paulo, Brazil 


Mr. I. R. Nogueira (Escola Superior de Agricultura “Luiz de 
Queiroz’”’, University of S. Paulo) and Mr. V. N. Murty (Central 
Tobacco Research Institute, Rajahmundry, India) have independently 
pointed out that the formulae given for F,,, on page 508 and for F,, on 
page 509 of this journal, Vol. 9 (1953) should have specified 


= (2 — 4r + — 
F,, = Q@(1 — + 4r + 6 + 4+ 3r') 


The author wishes to note also that the last equation on page 505, 
- and the first on page 506, should be respectively 


dyi+1 A,.:cH (1/log dz 
A,.,weH (1/log dz. 
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INTERNATIONAL BIOMETRIC SYMPOSIUM ON “THE ROLE 
OF BIOMETRIC TECHNIQUES IN BIOLOGICAL RESEARCH” 


TECHNICAL PROCEEDINGS* 


Instituto de Educacao Carlos Gomes, Campinas, Brazil, July 4-9, 1955 


RECENT ADVANCES IN BIOMETRY IN JAPAN 


Epitep sy M. MasuyaMa 
Institute of Physical Therapy and Internal Medicine, Tokyo University 
AND 
: M. HatTamMura 
Chief of Section of Physics & Statistics, National Institute of Agricultural Sciences; 
National Secretary, Japan, Biometric Society 


We summarize here the present status in Japan of recent studies of 
biostatistical techniques, and of their applications. 


BOOKS AND PERIODICALS 


Several books for statisticians, biometricians and various types 
of agricultural technicians have been published recently. For example: 
i) T. Kitagawa and M. Mitome [1953], Tables for the design of factorial 
experiments,** which is the first extensive collection of various designs 
thus far published and unpublished; ii) T. Kitagawa and M. Masu- 
yama [1952], Séatistical tables (revised), nearly one quarter of which 
were computed by Japanese statisticians; iii) T. Torii, K. Takahashi 
and I. Dohi [1954], Statistical methods in biological and medical research; 
iv) T. Nakayama [1954], Design of field experiments; v) M. Masu- 
yama [1952-4], Popular lectures on experimental designs: Part I, Estima- 
tion, Part II, Testing hypotheses, Part III, Planning and designs; vi) T. 
Yamamoto [1954], Sampling method in fishery catch statistics; vii) K. 
Kinashi [1954], Statistical method in timber survey, and viii) K. Mat- 
sushita and C. Hayashi [1955], Sampling method in forestry and its 
applications. 

Since 1953, the Journal of Biostatistics has been issued quarterly 
by R. Kawakami. Mainly reports on biometrical surveys are contained. 
Also, in autumn 1953, Y. Kondo, M. Hatamura, Y. Tumura et al. 


*Publication fi ially aided by the International Union of Biological Sciences. 
; **Detailed bibliographic citations are listed under “References.” 
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started publishing a quarterly journal Research Memoirs of Agricultural 
Statistical Sociely whose contents are articles, reports and news on the 
topics of crop and agricultural economic sampling surveys, and the 
design of agricultural experiments. 


GENERAL THEORY 


The optimality of the orthogonal designs from the viewpoint of 
estimating parameters was proved by S. Moriguti [1954b]. In his 
article the necessary and sufficient conditions for Taguchi’s formulae 
were given. The sufficient conditions were first given by G. Taguchi 
in his ‘Notes on experimental designs.’ 

As an interpolation formula for Tang’s table of the error of the 
second kind in the analysis of variance, 8. Miyagi showed empirically 
that the point (¢, 8) lies approximately on a line of the normal prob- 
ability paper for fixed f, , f. anda. Later M. Masuyama [1951] deduced 
his empirical formula, combining Geary’s approximation of the distri- 
bution of the ratio of two normal variates and a normal approximation 
of non-central chi-square distribution. This is the justification of the 
use of the square root paper for the non-central F-distribution. The 
exact distribution of this Geary’s statistic was given by M. Masuyama 
[1952]. After Yamauchi’s approach S. Ura extended Lehmer’s table 
of the error of the second kind in the analysis of variance for a = 0.05 
and 8 = 0.10. He used the notation y which was defined by 

where ¢ denoted Tang’s original notation. This modified notation 
was first used by M. Masuyama and it is more convenient than the 
original notation. 

S. Moriguti [1954a] deduces confidence limits of a variance component 
by Welch’s approach, which give fairly reasonable values even when 
f, is small, provided that f, is sufficiently larger than f, . 

Instead of various tables of random sampling numbers ‘Random 
dice’ (a set of 3 icosahedrons) and ‘Randomer’ (a handy device for 
producing 3 random digits instantaneously) are fairly widely used. 
They are made in the Development Research Division, The Japanese 
Standards Association. 

K. Kunisawa et al. published a seven figure table of Kolmogorov’s 
distribution 
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for x = 0.250(0.001)2.299. G. Taguchi [1952] gave the minimum value 
of h of the Polya-Eggenberger distribution 


n=0 n! 
for h/d = 0.5(0.5)15.0, 20.0, 30.0, 60.0, ©, k = 1(1)25 and 
F(k; h, d) = 0.95, 0.99 


Studies on various uses of the square root paper were published by 
M. Masuyama [1954c] in English in a booklet which contained many 
exercises in biometry and some industrial applications in non-central 
t- and non-central F-distributions. It was compiled for use in the 
Western Pacific Regional Seminar on Vital and Health Statistics, 
W. H. O. 

It is not new to use punched cards in the analysis of observed data: 
However, their use to find a required design seems to be new. Its 
theory was first given by C. R. Rao in 1951, but his method requires 
the use of minimum functions, which is not elementary and is impractic- 
able if the size of experiment is fairly large. To remove these difficulties 
M. Masuyama [1955e] used a simple method to produce a deck of 
cards which has the same function as Rao’s, except in the case of Youden 
squares. Various decks of mechanical and hand sort cards were pre- 
pared for almost all cases which are needed in laboratory or in factory. 


DESIGN OF AGRICULTURAL EXPERIMENTS 


Randomized blocks and split-plot design are common in various 
kinds of experiments; e.g. variety, seeding and fertilizer tests, or trials 
for the control of noxious insects or plant-diseases, which are performed 
at the agricultural experiment stations in each district or prefecture. 
Simple lattices, triple lattices and cubic lattices are occasionally used in 
variety yield tests of paddy-rice, wheat, soybean and maize. Lately, 
several groups of experiments conducted in cultivator’s fields have 
been summarized statistically, and some variety X place interactions 
were evaluated. 

In the Bulletin of the National Institute of Agricultural Sciences, 
sertes A, M. Hatamura, T. Okuno and T. Sasaki published a lengthy 
article ‘On the design and analysis of field experiments’, in Part I of 


which the necessary techniques for the statistical analysis of an experi- — 


ment are introduced. In Part II, the randomized block design. is 
selected, the relation between the random assignment of treatments in 
each block and the assumptions underlying the normal regression theory 
is discussed, and the stochastic models and discussions of B. L. Welch, 
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E. J. G. Pitman, A. Wald and O. Kempthorne are generalized to some 
extent. In Part III, the most general theory for the incomplete block 
designs is deduced by the use of particular symbols. Analyses of the 
balanced lattice, the simple lattice, the triple lattice, the cubic lattice 
and the rectangular lattice are obtained as special cases. 

In the Research Memoirs of the Agricultural Statistical Society, 
T. Okuno and T. Sasaki reported ‘Studies on the stochastic models in 
analysis of covariance’. In this report, it is assumed that the inde- 
pendent variable z as well as the dependent variable is subject to 
treatment and block effects, and that the models for all variates are 
the same as in the above article. Under these assumptions exact tests 
of significance of treatment effects are introduced and discussed in 
detail. 


BIOASSAY 


Modern techniques of bioassay were intensively used by Y. Ito and 
his collaborators. A monograph, “Bioassay and stochastics” was 
published in 1953 which was characterized by balanced introduction of 
biological and mathematical aspects of recent development of bioassay 
techniques. An empirical method of point-wise estimation of synergistic 
action was discussed by M. Masuyama [1955b,c] and his method was 
applied to study the joint action of two antibiotics, whose dose-response 
curves could not be represented by the logarithmico-normal distribution 
functions with the same standard deviation. 

On the other hand, in the field of insecticides, various studies were 
carried out by Kono, Kono & Utida, Nagasawa, Ohsawa & Nagasawa, 
Ohsawa and others. 

In 1954, Penicillium islandicum and other toxic fungi were found in 
imported rice, so that a Toxicological Research Group and Sampling 
Research Group were organized. As a member of the last group M. 
Masuyama [1955] wrote an article in which he suggested the use of 
composite samples in two cases; i) to screen the lots which contain 
toxic fungi, assuming the total number of lots is finite and the fraction 
of defective lots is fairly small, and ii) to estimate the percentage of 
infested grain by the method of “multi-grain culture per one Petri dish”’. 


POPULATION GENETICS 


After the second War, concepts of population genetics were intro- 
duced. M. Kimura, K. Sakai and E. Nakamori reported theoretical 
studies on the segregation of genes and the degree of homozygosis in 
continued self-fertilization. M. Nei reported mathematical studies on 
the hybrid behavior of partially allogamous plants. 

K. Sakai [1951] and K. Sakai et al. [1951] discussed the importance of 
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bulk-method in breeding. K. Sakai et al. [1953b] studied the decrease of 
hybrid vigor through the successive generations. Further studies on 
change of heritability in autogamous plants by K. Sakai [1954], estima- 
tion of combining ability by N. Nakamura [1953] and heritability in 
eggplant by K. Gotoh [1954] should be mentioned. Heritabilities in 
maize and in sheep were calculated by N. Nakamura et al. [1953] and 
by K. Matsumoto et al. [1954] respectively. Brief reports concerning 
heritability were published in swine by M. Taketomi et al. [1953, 1954], 
in White Leghorn fowl by Y. Onishi [1953], in tobacco by S. Iyama 
[1953], and in soybeans by Y. Yoshino et al. [1953]. 

Results of bulk-method breeding were reported in rice by T. Naga- 
matsu et al. [1953] and in wheat by S. Sekizuka [1953]. Meanwhile, 
T. Hoshino [1953] and K. Sakai et al. [1953a] called attention to the 
interplant competition in rice, as did T. Yamada [1953, 1954] and 
T. Yamada et al. [1953a,b] in the case of barley, soybeans, red clover 
and wheat. In addition, K. Sakai [1953] estimated the magnitude of 
the competitional variance in mixed plant populations of wheat varieties. 
Thus, theoretical studies on bulk-method breeding are going to be 


reexamined on the basis of the interplant competition. 


ECOLOGY 


Research of Population Ecology, vol. 1 (1952) and vol. 2 (1953) 
contained many studies on the population ecology of various insects by 
the group of entomologists in Kyoto University. Many probabilistic 
investigations on inheritance have been performed by Y. Komatu 
and his collaborator since 1951. 

For sampling survey of vegetation, optimal size and number of 
square grids (sampling units) were studied by M. Numata [1949]. 
The number of square grids is closely related to the homogeneity of 
plant distribution. In this article, he defined a coefficient of homogeneity 
(h) under the assumption of normal distribution. , 

There are three types of plant distributions; i.e. random, regular and 
aggregate distributions. The homogeneity of plant distribution means 
the non-aggregate distribution. By M. Numata [1950b], it is shown 
that such a homogeneity consists of individual homogeneity (h) and 
a communal one (floristic and vegetational). 

Several topics in the study of vegetation, viz. typological method, 
sampling method, plant distribution, and the size and number of square 
grids etc. were discussed by M. Numata [1950c]. 

M. Numata [1954b] defines the structure of a plant community as 
a wide concept, including quantitative composition and spatial distri- 
bution of species and life forms, dispersion of biological characters 


- measured phytosociologically, and their dynamic variation. Among 


) 
> 
; 
3 
1 
f 
n 
re 
a, 
in 
1g 
of 
of 
” 
cal 
i 
in 
on 
of 


454 BIOMETRICS, DECEMBER 1956 


these, the dispersive structure is the term widely used to denote plant 
distribution, and constitutes a large part of the quantitative structure 
treated from the view-point of homograde statistics. 

There are several methods of stratification of vegetation. The 
stratification by dominant species does not, in general, coincide with 
that by the physiognomy (general outward appearance) of vegetation. 
The latter i.e. ‘ecological stratification’ seems to be justified statistically 
(M. Numata and H. Nobuhara [1952]). 

Motomura in 1932 introduced ‘the law of geometrical progression 
of the population density’. The frequencies of individuals of each 
species in a sampled area in certain animal populations are approxi- 
mately arranged in a geometrical progression. M. Numata, H. Nobu- 
hara and K. Suzuki [1953] tried to give its theoretical basis and its 
practical interpretation in plant populations. 

The structure of a plankton population was studied statistically by 
T. Ito and M. Numata [1954], by means of an artificial population which 
was made of coloured sesame in a 5% solution of NaCl. 

Shinozaki and Urata in 1953 showed the close relationship among 
Motomura’s law, Corbet’s law of harmonic series, Fisher’s law of logarith- 
mic series, and Preston’s logarithmic curve. The observations in a 
beech forest at Mt. Daisan, Tottori Prefecture, Japan showed that 
there were three types of populations; i.e. the GP type for the geometrical 
series, non-GP type or S type which was derived from Shinozaki and 
Urata’s homogeneity concept, and intermediate type. The frequency 
of species in that beech forest was represented by the geometrical 
series only when the size of sample was small or the stratification 
according to life forms was introduced, and otherwise the S type popu- 
lation (H. Nobuhara and M. Numata [1954]). 

The change of distribution function in the population depending 
upon the size of grid was studied by M. Numata [1950d]. The distribu- 
tion function seems to be Poissonian for smaller grids, and binomial for 
larger grids. 

Seasonal variation of the type of weed communities in rice-fields, 
farm lands, forest floors, and a wasteland was studied. Vegetation 
types were expressed quantitatively by means of the coefficients of 
biological types. The forms of seasonal variations of vegetation types 
represent biologically habitat conditions. These variations were 
analyzed by M. Numata [1953], from the viewpoint of time series. 


FORESTRY 


Plane integral geometry was applied to estimate the basal area and 
the perimeter of timber (and the number of trees in a stand if necessary) 
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by M. Masuyama [1953a,b]. The same method is applicable to esti- 
mate the total area under certain crops. Under certain conditions 
integral geometrical method was proved to be more efficient than the 
ordinary areal sampling method. Instead of the fairly expensive 
Pendelrelaskop or Spiegelrelaskop, M. Senda and K. Maezawa [1955] 
made a simple instrument which had an angle-gauge with a slit of 
constant width with which the bias due to the slope of a stand was 
automatically removed. 


FISHERIES 


It was after World War II that statistical methods were intro- 
duced into surveys of fisheries, and that a considerable development in 
sampling method appeared. Since 1951, the fishery catch statistics in 
Japan have been prepared on the basis of sampling methods used on a 
nation-wide scale. Details have been described in Sampling method 
in fishery catch statistics [1954] by T. Yamamoto, and a general descrip- 
tion was issued as an Occasional Report by the regional office of Food and 
Agriculture Organization, Bangkok, in 1955. In these surveys, various 
sampling methods have been applied in accordance with actual siidu- 
tions in commercial fisheries. Double sampling ratio estimate using 
the number of trips as an auxiliary variable may save labour and expense 
in enumeration. 

Meanwhile, biological surveys of commercial landings have been 
made, usually, by multistage systematic sampling (for instance, boat- 
container-individual fish). T. Doi [1948, 1949a,b], T. Doi et al., [1951], 
Y. Fukuda [1953], S. Kurita [1948], S. Tanaka [1953a,b,c, 1954a,b] and 
I. Yamanaka [1953, 1954a,b] each on different occasions dealt with 
methods of sampling, estimation and analysis, particularly for estima- 
tion of survival rate from age-composition. A study made by 8. Tanaka 
[1955a,b] concerned the method for estimating the total abundance of 
fish eggs spawned. Under the guidance of T. Kitagawa, T. Yokota [1953] 
attempted to count the total number of fish shoals from the images of 
a fish finder. T. Doi [1950a,b, 1955a,b] and T. Kawakami [1952] exam- 
ined methods for prediction of future catch, analysing the auto-correla- 
tion of catch and the structure of the fish population (age-composition, 
mortality rate etc.). T. Yoshihara (1951, 1952] studied the growth 
of carp in a pond, fitting a logistic curve. The distribution of fish on 
the tuna long-line and salmon gill-net was discussed by T. Yoshihara 
11954] and H. Maeda [1953] respectively. 

We cannot hope that this summary covers all biometrical projects 
in Japan during the last 5 years. Unintentional omissions will be 
reported at the next opportunity. 
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CONTROL OF ERRORS IN SURVEYS 


Morris H. HANSEN AND JOSEPH STEINBERG 
U. S. Bureau of the Census, Washington, D. C., U.S.A. 


1. INTRODUCTION 


In any process of data collection there are potential sources of error 
at every stage in the survey procedure. Errors may arise in defining 
the problem, in defining the universe to be studied, and in defining the 
concepts or establishing the measurement procedures (such as the 
question wording). They may originate in the sampling, i.e., in the 
specification of the units to be included and in the coverage of these 
units. In surveys conducted by field interview, they may arise from 
the complicated structure of the interview situation, in some measure 
stemming from the understanding, interest, motivation, knowledge, 
and skill of the interviewer and the respondent. Procedures for handling 
the data, such as coding and editing and tabulation, may also lead to 
errors in surveys. 

It is the purpose of good survey design to control the errors arising 
from these sources to an economic level. This level is reached when an 
increase in expenditures will not produce a worthwhile decrease in the 
risk of making wrong decisions from the survey results. Although we 
have not found a way to determine objectively this optimum level of 
control in connection with general purpose surveys in which a number 
of different statistics are produced and used for many different purposes, 
we do give the problems continuing attention. The problem of control 
is one of particular concern because errors arising from the various 
possible sources, especially those arising in the field collection of the 
data, sometimes are much larger than is commonly recognized. Often 
they can be controlled satisfactorily, if at all, only by explicit steps 
taken for such control. It is easy, on the other hand, after one becomes 
acquainted with the frequency and magnitude of individual errors, to 
be unduly pessimistic about the value of census or survey results. With 
reasonable procedures for control, and with the tendency for some of the 
types of errors to be more or less compensating, the net effects may be 
small enough so that the statistics will serve adequately many different 
purposes. In this setting, it becomes important to establish procedures 


‘for evaluating and for controlling the errors that may arise from various 


sources. We shall confine our discussions primarily to the errors arising 
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in the collection process, such as those involved in attempting to comply 
with the sampling specifications and with other procedures in collecting 
the data. 

It should be emphasized at the beginning that it is one thing to 
specify a good survey design, and quite another to set in motion ad- 
ministrative procedures and controls which insure that the operations 
are carried through substantially as specified. It is to this problem 
that we wish to devote particular attention. 


2. SOME GENERAL APPROACHES TO 
CONTROLLING ERRORS IN SURVEYS 


Many types of controls have evolved from experience and are com- 
monly accepted as effective. The methods ordinarily used to control 
errors—careful selection of personnel, thorough training, and review 
and correction of work—are not always used effectively in the field 
collection phase. Often the review operations involve only the ex- 
amination of returned questionnaires for more or less obvious in- 
consistencies or failures to follow instructions, and perhaps a limited 
amount of reinterviewing or observation of interviewers at work. 

When we are confronted with evidence of unsatisfactory survey 
results, we tend to depend more heavily on such activities as more 
intensive training, or more supervision. Such methods may be effective. 
However, they should be evaluated in an effort to determine whether 
they yield the desired results and whether the additional expenditures 
‘entailed are commensurate with the added accuracy obtained. It is 
important to recognize that some of these methods of control in censuses 
and surveys may be far from sufficient and that evaluation of each 
phase is desirable. It is unfortunate that comparatively few attempts 
have been made to evaluate such methods objectively. While there 
have been some efforts along these lines in the U. S. Bureau of the 
Census, the problems are difficult and the results have not been satis- 
factory. We believe that much additional work is needed in the objec- 
tive evaluation of such techniques. 


Controls in the U. S. Current Population Survey. 


We shall consider the problem of control primarily by describing 
and discussing some of the methods followed as well as some of the 
results of applying these methods to our Current Population Survey. 
(This is the monthly population sample survey taken by the U. S. 
Bureau of the Census for estimating labor force characteristics, and, 
from time to time, information on various other topics.) In this survey, 
field interviews are taken from a sample of about 21,000 households 
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each month.* The interviewing is done within about a week in each 
month, and the results are compiled and published within about two 
weeks after the end of the enumeration period. Approximately 350 
part-time interviewers work under the direction of about 60 full-time 


‘Supervisors and assistant supervisors in 33 field offices.** The sample 


is spread over 230 primary sampling units each consisting of one or two 
(or occasionally more) adjoining counties. One or a few interviewers 
work in each primary sampling unit on a part-time basis. Most of 
them are housewives, providing a labor supply of relatively high educa- 
tional status and ability, interested and effective in field interviewing. 


General procedures for selection, training, and supervision in the Current 
Population Survey. 


Of course, we attempt initially to select enumerators who are 
qualified to do the kind of work that is involved in our surveys. As an 
aid in selection a series of tests have been designed to determine whether 
enumerators can understand the concepts, read maps, and follow the 
ordinary instructions which they are required to observe in conducting 
the surveys. 

After enumerators have been selected they undergo a training period. 
At the present time new interviewers receive three days of instruction 
during their first month of employment. The initial training consists 
of two days in the area in which the interviewer is to work; it covers 
general indoctrination and lectures on the two major operations—listing 
and interviewing—and it includes supervised field praqtice by the 
enumerator. Other matters receiving considerable attention are mock 
interviews, administrative matters, and work assignments. This first 
two-day period is divided into a number of training sessions of varying 
length. A training guide is provided to the supervisor indicating how 
each period is to be utilized. 

After the initial training the interviewer is given written assign- 


‘ments and review exercises to be completed at home and mailed to the 


supervisor. The third day of training is given on the first day of enumera- 
tion week of the first month in which the interviewer works. On this 
day the supervisor observes actual interviews, checks some of the listing 
work, reinterviews some of the households, and then continues such 
follow-up training as he believes desirable. 


*Starting in May 1956, the publication of results has been based on a larger sample, now com- 
prising about 35,000 interviewed households in 330 primary sampling units. A brief description 
of the expansion appears in The American Statistician, April 1956, pp. 5 and 6. 

**T his staff also is responsible for the field collection of information for the monthly retail trade 
survey and also works from time to time on other census surveys. 
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In the second month, the interviewer receives some home study © 
materials and exercises, for completion and return for review and 
correction. The supervisor then accompanies the interviewer for about 
a day during enumeration week and observes his work, reinterviews 
some of the households, and discusses with the interviewer problems 
that he observes. In the third month home study materials are again 
used, but usually there is no supervisory visit unless the caliber of the 
interviewer’s previous work has indicated a need for special attention. 

Regular training of experienced interviewers follows this pattern: 
About four times a year, group training sessions are held. In these the 
supervisors review the problems arising in the enumeration and discuss 
specific new types of questions which the interviewers are to use for 
supplementary information. They stress such things as approaches in 
interviewing, completeness of coverage, preparation of forms, and other 
pertinent matters. In other months training is by mail and is adjusted 
to cover any supplemental inquiries for that particular month. Home 
study exercises are provided to the interviewer who completes and 
sends them to the supervisor for review. (Interviewers are paid for a 
stated number of hours to be devoted to these home study exercises. 
They also receive pay for attending group training sessions.) In addition 
to this, supervisors schedule about two visits a year to observe the work 
of trained interviewers and to see how they approach their work and 
how they conduct themselves in obtaining the information. The 
types of controls possible through these training and observation 
periods are usually fairly subjective, since they depend so much upon 
the ability of the supervisor to detect inadequacies and to correct them. 
The group training sessions are designed to stimulate interest, to moti- 
vate the interviewer to do a good job, and to show him the proper 
procedure for carrying out his assignments. 


3. QUALITY CONTROL ON FIELD OPERATIONS OF THE 
CURRENT POPULATION SURVEY .- 


In an attempt to control the quality of the results of field interviewing 
in the Current Population Survey we have instituted a formal quality 
control and quality check procedure. This has two objectives. First, 
it is a device to control the quality of work of individual interviewers; 
and second, it provides a check on the overall quality of coverage as 
well as content of the survey. 

The system of quality control has been designed to identify those 
interviewers whose work is beyond acceptable limits of performance, 
so that they may be singled out for retraining or other administrative 
action. Our basic policy is to concentrate the supervisory work on 
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those interviewers whose work is poorest—and to give less attention to 
interviewers whose work is satisfactory. With a relatively stable 
interviewer force, this approach would seem to yield maximum results 
for a given amount of supervisory effort and cost. The importance 
attached to the issue of error control can perhaps best be summarized 
by noting that of every $100 spent on enumeration, about $11 is spent 
on checking the quality of the information collected and retraining 
interviewers whose work is unacceptable; an additional $18 is spent on 
regular training and observation. 

Our quality control operation is carried through by supervisory 
personnel and involves redoing the listing and reinterviewing the house- 
holds in a subsample of the areas covered by the survey. There is a 
presumption that the supervisors can list and interview better than the 
interviewers and that their work can serve as a norm against which to 
judge the quality of the work of the individual interviewers. In any 
one month the subsample consists of about one-fourth of the inter- 
viewers, and about one-third of the work of each of these interviewers is 
covered. To provide an objective method of choosing the work to be 
reviewed, a subsample of the 230 areas is selected (taking into account 
the relative workloads); and within this selected subsample of areas a 
subsample of the segments is predesignated for check. In determining 
the segments to be selected for this check, clusters of segments are 
arranged so that any one cluster tends to be essentially the work of a 
single interviewer and is of sufficient size to enable detection of poor 
work. 

When the original interviewer transmits his completed schedules 
to the district office, a clerk in the office transcribes the information 
from each schedule to a reconciliation form. These reconciliation forms 
are then placed in an envelope which is put at the disposal of the person 
conducting the check for use only for reconciliation after the reinterview 
has been completed. 

The supervisor recanvasses the segment and checks the living 
quarters he finds within the segment against the lists originally prepared. 
Errors that are found in coverage of households are recorded. Within 
the sample households he is instructed to make a check of coverage 
of persons and to conduct an independent reinterview using a schedule 
identical with the one used originally. Wherever possible, the reinter- 
view is conducted with the person regarding whom the information is 
being obtained; second choice is the original respondent, and next any 
other acceptable respondent. Then the supervisor compares the 
results of the reinterview with the information on the reconciliation form. 
Differences are called to the attention of the respondent and an attempt 
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is made to determine the correct answer. If the reinterview answer is 
confirmed the supervisor tries to find the reasons for the discrepancy. 
In addition, the supervisor calls all remaining differences to the attention 
of the interviewer. The reconciliation reduces the discrepancies origin- 
ally found in the content reinterview by about 20 percent. The reinter- 
viewing is done, on the average, about a week following the original 
interviewing. 

This program gives the supervisory staff a measure of quality of 
the work of each interviewer. In keeping with the objective of quality 
control (which is to keep the overall rate of error within some pre- 
determined acceptance limits) the differences found in the work of each 
interviewer are summarized and a determination is made of the ade- 
quacy of his work with respect to (1) coverage of units, (2) coverage 
of persons within households and (3) the subject matter content. At 
present the control for each of these is set so that 95 percent of the 
enumerators with gross error rates of 5 percent will be considered 
acceptable. 

Those interviewers whose work is unacceptable on the basis of any 
of these measures of quality are supposed to be retrained for one day 
prior to or on the first day of the next enumeration period. In this 
training session the supervisor discusses with the interviewer the matter 
of his problems and shows how to overcome his difficulties. A supple- 
mentary check is to be made the following month to determine whether 
the interviewer’s work is acceptable. Those interviewers whose work 
is not under control after such additional instruction are to be replaced 
or given further training. 

Let us examine some of the results of this check operation to date. 
First let us consider the quality control aspects in terms of the work of 
individual interviewers. Table 1 gives a distribution of monthly gross 
“error” rates of interviewers for a 12-month period May 1954 through 
April 1955. As can be seen, a relatively small proportion of the inter- 
viewers made a large proportion of the errors in coverage. On the 
other hand, discrepancies were not so concentrated with respect to 
subject matter, although only a small proportion of the interviewers 
are deemed unacceptable. 

Table 2 shows in summary the results of some efforts to classify 
errors in subject matter by cause. These classifications are assigned by 
the check interviewer at the time of the reconciliation, and must be © 
regarded as more or less superficial. They may, nevertheless, provide 
some guide to the sources of differences. On the basis of this analysis, 
about one-fourth of the differences between the original survey and 
the reinterview arose when a different respondent was reinterviewed. 
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TABLE 1 


DistTrRisuTION OF INTERVIEWERS’ Gross ‘ERROR’ RATES AND OF GROSS 
DiFFERENCES REePporTED, CPS Montuiy Recueck, May 1954 


Aprit 1955 


Frequency in monthly 


assignments checked Gross differences reported 
Gross error rate 
(percent) Cumulative Cumulative 
Actual percentage Number percentage 
Coverage: 
0 558 100 _ 
0.1- 0.9 1 15 3 100 
1.0- 1.9 7 15 7 99 
2.0- 2.9 7 14 7 97 
3.0—- 3.9 6 13 7 95 
4.0- 4.9 1l 12 12 94 
5.0- 5.9 12 11 14 90 
6.0— 6.9 5 9 12 87 
7.0- 7.9 11 8 13 83 
8.0- 8.9 & 6 25 80 
9.0- 9.9 4 5 8 73 
10.0-14.9 10 5 25 71 
15.0-24.9 8 3 53 65 
25.0 and over 12 2 189 50 
Total 660 375 
Content: 
0 281 100 
0.1- 0.9 66 58 98 100 
1.0- 1.9 83 48 247 96 
2.0- 2.9 71 35 352 86 
3.0- 3.9 36 24 245 73 
4.0—- 4.9. 28 19 241 63 
5.0- 5.9 27 15 247 53 
6.0- 6.9 21 11 237 44 
7.0- 7.9 13 7 148 34 
8.0- 8.9 8 5 99 29 
9.0- 9.9 5 4 86 25 
10.0-14.9 14 3 227 21 
15.0-24.9 5 1 172 12 
25.0 and over 4 1 142 6 
Total. 662 2,541 


Note that the workload per enumerator varies and consequently the base for the gross error 


rate is not constant. 
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Approximately a third of the differences arose because cither the respond- 
ent misinterpreted the question or the interviewer misinterpreted the 
answer. 

Relatively few differences seem to arise because the questions are 
not asked properly or because the interviewer misrecorded the answer. 
About one-third apparently resulted even though the respondent 
understood the question: in some cases it appeared that he was thinking 
of a different survey week, forgot part of the correct answer, or erred 
in his calculations. Thus, one-half or more of all differences seem to 
arise because the respondent was at fault, and perhaps only about one- 
fourth of the differences can be attributed to errors by the interviewer. 
Consequently, the gross differences do not provide a sensitive indicator 
of the quality of an interviewer’s work, and their use as a measure of 
the performance of interviewers may lead to inefficient use of resources. 
This analysis raises a question whether the quality measure of inter- 
viewer performance should not be based on differences classified as 
interviewer errors in the check interview. Then perhaps the remaining 
differences, when they are large, can be regarded as indicating the 
existence of difficult interview situations, and additional training of 
interviewers can be focused on insuring that the respondent is led to 
give the proper answer or to understand what is wanted from him. 


TABLE 2 
DISTRIBUTION OF APPARENT CAUSES OF DIFFERENCE IN CLASSIFICATION 
Berween CPS OrrernaL AND INTERVIEW, FOR IDENTICAL 
Inpivipuats, May 1954 Fresruary 1955 


Apparent cause of difference Percent (of 257) 
Different respondent 27.3 
Due to enumerator 22.57 

Questions not asked properly 3.1 
Enumerator misinterpreted answer 47.2 
Enumerator misrecorded answer 2.3 
Due to respondent 47.5 
Respondent misinterpreted question 12.8 
Respondent understood question 
but reported incorrectly 34.7 
Other 2.7 
Total 
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We have hesitated to take this approach, and have not done so yet, 
because of the dubious nature of the classification of the cause of the 
differences. 

These data are only fragmentary, and as we continue to study 
further results we hope to learn more about the nature of the problem. 
It is interesting to note that in about half the March 1955 cases where 
the interviewers’ work was unacceptable on the basis of the measure 
of discrepancies from the check, the supervisor did not carry through 
the retraining and reported that he did not regard it as desirable or 
necessary. 

The results of the reinterviews are also summarized to provide an 
overall measure, of the quality of the interviewing. Table 3 shows the 


TABLE 3 


MonrTHiy AVERAGE RESULTS FOR IDENTICAL PERSONS INCLUDED IN ORIGINAL 
AND REINTERVIEW SAMPLE, May 1954 THrouGcH Aprit 1955 


Monthly average num-| Average net 
ber of persons reported|difference, (2)—(1)| Percent 
Employment status identically 
In original} In check | Number} Percent | reported 
interview | interview 
(1) (2) (3) (4) (5) 
Labor Force 1,185 1,201 16 1.3 97.7 
Employed 1,123 1,137 14 1.2 98.1 
Agriculture 127 131 4 3.0 94.7 
Nonagriculture 996 1,006 10 1.0 98.5 
Full-time (worked 35 
hours or more) 729 726 - 3 — 4 97.5 
Part-time (worked 
less than 35 hours) 215 226 1l 4.9 87.9 
With a job but not at 
work 52 54 2 3.7 85.6 
Unemployed 62 64 2 3.1 87.8 
Not in Labor Force 959 943 —16 —-1.7 98.9 


results of the original interviews and the reinterviews over the 12-month 
period May 1954 through April 1955. It also shows the proportion of 
persons identified in a particular class in the check who are identified 
in that same class in the original interview. This table relates only to 
persons covered by both the original and reinterview and thus excludes 
about 15 percent of the eligible persons (those not covered because of 
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listing errors, non-interviews in either the original survey or the check, 
and persons who moved in the interval between the two interviews). 
Only the difference for persons working part-time and that for aggre- 
gates affected by this class appear to be statistically significant. The 
comparatively small net differences reflect the fact that many of the 


gross differences tend to be compensating. 


3 
2 EMPLOYED IN AGRICULTURE WORKING - NON AG, - 


o+—+ + +—+ —+—+ + 
MITASONDIFMA 


1954 1955 1954 1955 
.6 
WORKING NON AG, PART TIME WITH A JOB = NON AG, = NOT AT WORK 
9 


MIJTASONDIFMA 
~ 


1954 1955 
7 
UNEMPLOYED 
el 


INDEX OF SHIFT 


5 i] + + + 
A MIJTASONDIFMA 
3 1954 1955 
| NOT IN LABOR FORCE 
elt el 
AL a DI FMA 
1954 1955 1954 1955 


FIGURE 1. INDEX OF SHIFT OF LABOR FORCE STATUS ITEMS, CPS RECHECK, 
AFTER RECONCILIATION, MAY 1954 THROUGH APRIL 1955 


Fig. 1 examines the reinterview results in a somewhat different form. 
The index of shift is the ratio of the gross differences in a given category 
to the number in the category as determined by the reinterview. Thus 
a high index of shift indicates great instability in the classification of 
individuals into the category. The groups with marginal attachments 
to the labor force such as the part-time employed, those with a job 
but not at work and the unemployed have the greater instability. 
One of the things which we have been speculating about is the degree 


to which the index of shift especially in the case of unemployment, | 


may reflect the variability in classification over time caused by the high 
seasonal incidence of persons with relatively small attachment to the 
labor force. 

We have also examined some of the net differences to determine 
whether on an overall basis the results of the Current Population Survey 
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can be considered to be under control. Figs. 2a and 2b show net differ- 
ences between the original interviews and the reinterviews. (Points 
above the horizontal axes denote a net relative understatement in the 
original Current Population Survey enumeration; points below denote 
a net relative over-statement.) For the most part, the points on the 
chart seem to indicate that the various statistics are under control, 
although for the unemployed group (Fig. 2b, right) one may speculate 
whether the results in July and November 1954 may not suggest some 
problems for those months. 

Experience to date is too limited to give any assurance that this 
system of quality control will adequately signal trouble when it arises. 
The reinterview and reconciliation procedures that we have been 
using do not directly enable us to distinguish those errors or differences 
caused by the respondent from those caused by the interviewer. While 
our efforts have been designed to keep our net errors small, the quality 
control features of this program have been aimed toward reducing the 
gross differences. However, it appears necessary for us to direct more 
attention to the kinds of errors contributed by respondents. Also, 
intensive work on the evaluation of coverage has not been sufficient to 
explain consistent differences of about 2 to 4 percent in the total popu- 
lation estimated from the Current Population Survey and that esti- 
mated on the basis of the last census. We are not yet satisfied that our 
intensive measurement methods are adequate. 


4. FUTURE WORK 


The check operation discussed represents our major effort so far 
to ascertain differences objectively, and to control them by determining 


‘the sources of contribution to error and by taking steps to deal with the 


problems identified. We have been experimenting with more intensive 
types of reinterview schedules in an effort to determine whether we can 
more clearly ascertain reasons for differences. Fragmentary results 
now suggest that these more intensive interviews do lead to improved 
understanding, although no substantial changes in error or gross 
difference rates have been observed from their use in our limited ex- 
perience. One observation is that the use of a check list which more 
precisely defines the concepts to the respondent may help in carrying 
out both an initial interview and reinterview. We are currently con- . 
sidering the possibility of using such a check list in the original inter- 
views. 

A great deal of research still remains to be done. We are conducting 
additional studies to learn more about sources of error in our surveys 
and about the effectiveness of our supervisory activities. Much still 
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needs to.be learned as to the relative advantages or disadvantages of 
the different steps taken, which are based on intuitive grounds and for 
which, to date, little quantitative information is available. For example, 
what should be the methods and the periodicity for group training? 
What should be the periodicity for observations? What other type of 
measures do we need to institute? What incentives can we provide so 
that the persons working at different stages of a survey can be properly 
motivated and so that the types and number of errors can be controlled 
to a desired level? And last but not least, how can we insure that the 
respondent can be “trained” to give the information which we need in 
order to provide the data required? ; 

Survey design has come a long way. Sample design has been studied 
so that control of its contribution to the mean square error of survey 
estimates is well understood. We have yet a lot to learn about control 
of the other contributions to the mean square error. This is a report 
of progress on approaches to such control in one continuing survey, 
and on some of the problems and difficulties still to be resolved. 
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THE STUDY OF THE PHYSIOLOGICAL EFFECTS OF HOT 
CLIMATES 


J. O. 


Statistical Research Unit of the Medical Research Council, London School of Hygiene 
and Tropical Medicine, England. 


1. THE PROBLEM—EFFECTIVE TEMPERATURE 


“One of the physiological requirements for health is the maintenance of a 
practically constant body temperature. In man the average mouth temperature is 
about 98.4° F. and the temperature of the deep tissues is about 99° F. The body 
temperature is controlled by physiological mechanisms which regulate the rate of 
heat loss from the body surface. It is through the operation of these mechanisms 
that the temperature of the body remains nearly constant even though the warmth 
of the environment varies over a considerable range, and over long periods measure- 
ments of heat production and heat loss correspond very closely. But in hot environ- 
ments the body may not be able to get rid of heat as fast as it produces it, then the 
body temperature will rise, and if the exposure is prolonged, serious results may 
ensue.” 

“The human body loses heat by three paths, radiation, convection and evapora- 
tion. The environmental factors which affect the rate of heat loss are the temperature, 
humidity and rate of movement of the air, and the radiation from the surroundings, 
but the rate of loss is largely governed by the physiological mechanisms which serve 
the body as thermostatic controls.” Bedford [1946]. 


Thus in comparing two thermal environments we have to take 
account of temperature, humidity, rate of air movement and radiation. 
The advantage, indeed the practical need, of having a single index of 
thermal environment became apparent a long time ago. This need 
was to a considerable extent met by the index known as “effective 
temperature’”’. Originally, effective temperature was designed to take 
account of the temperature, humidity and rate of movement of the air 
and was a measure of subjective feelings of comfort. It was based on 
experiments carried out at the Research Laboratory of the United 
States Bureau of Mines, Pittsburgh, by Houghten and Yagloglou [1923] 
and their colleagues. 


“Trained observers passing between two controlled climate rooms identified 
widely varying combinations of dry and wet bulb temperatures which gave rise to 
the same subjective sensations of comfort. The conditions in one room were kept 
constant and in the other the temperatures were raised or lowered until the tempera- 
ture was identufied at which the sensations in the two rooms corresponded accurately. 
The walls of the climatic rooms were at substantially the same temperature as that 
of the air. “Equal comfort lines” for “still air” fitting the corresponding wet and 
dry bulb temperatures in the two rooms were superimposed on a standard psycho- 
metric chart. The effects of higher air velocities were then investigated, and align- 
ment charts were constructed for assessing “effective temperature” which related 
conditions under examination to the corresponding temperature of still and saturated 
air at which the same effect on thermal/comfort would be experienced.”’ (Ellis [1953}). 
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Effective temperature is therefore the temperature of still air saturated 
with water vapour in which a sensation of warmth, equivalent to that 
experienced by those in a particular environment under investigation, ts 
on the average reported by the subjects in a long series of tests. Since 
clothing reduces the effect of air movement, a “normal scale’ was 
constructed for persons wearing ordinary indoor clothes and a ‘“‘basic 
scale” for persons stripped to the waist. The limiting conditions to 
which men might be exposed without sustaining ill-effects were said 
to be an effective temperature of 90° F. if they were at rest and 80° F. 
if they were engaged in heavy work. 

The original scale of effective temperature took no account of 
radiation. Bedford pointed out that it can be corrected approximately 
for the effects of this factor if the globe thermometer temperature is 
used instead of the dry bulb temperature in calculating the effective 
temperature. The globe thermometer is an ordinary thermometer 
with its bulb at the centre of a hollow 6-in. metal sphere coated with 
matt black paint. If the walls and other surfaces which surround the 
globe are warmer than the air, the temperature recorded by the ther- 
mometer inside the globe is above air temperature. Consequently, the 
readings of the globe thermometer make some allowance for radiation 
from the surroundings. 

The effective temperature scale was devised well before the outbreak 
of the second world-war. Problems of excessively hot environments 
aroused a great deal of attention during that war because these con- 
ditions were experienced in warships serving in tropical waters. 


2. PREDICTED FOUR-HOUR SWEAT RATE 


In 1944 the British Medical Research Council was asked by the 
British Admiralty to investigate the climatic effects of the working 
conditions prevailing between decks in tropical waters. This led to 
the establishment of the Tropical Research Unit at Singapore in Jan- 
uary 1949. 

Meanwhile a team of civilian workers and naval medical officers at 
the National Hospital, London, under the leadership of Dr. McArdle 
observed the physiological effects of heat on men at rest or working at 
fairly high rates stepping on and off 12-inch stools. Observations were 
made on young naval ratings after they had been acclimatised “‘arti- 
ficially’’ to work at high temperatures by daily work under hot condi- 
tions for two or three weeks. These young men were then able to 
work satisfactorily at tasks involving energy expenditures similar to 
those of gun crews during a bombardment and whilst wearing anti-flash 
protective clothing, when the wet bulb temperature was as high as 
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90° F. The findings in general supported Haldane’s [1905] view that 
the level of the wet bulb temperature was a useful guide to the thermal 
tolerability of an atmosphere, and showed that the upper permissible 
levels of wet bulb temperature could be raised by 2° to 3° F., if men 
worked stripped to the waist in shorts instead of wearing overalls, or 
alternatively if the air movement was raised from 20 ft/min. (still air) 
to 200 or 300 ft/min. (for men wearing shorts or overalls respectively) 
under hot humid conditions. On the other hand, they found that 
little was to be gained by increasing the air movement above 300 ft/min. 
under the conditions of these experiments. It was accepted that these 
limiting conditions would be lowered if the temperature of the walls or 
surroundings exceeded the air temperature, if the men worked more 
vigorously, if they were already fatigued or untrained, or if the subjects 
were old men or less well acclimatised to work at high temperatures. 

The standard effective temperature charts apply only to. men 
engaged in light or sedentary activities, and the workers at the National 
Hospital found that they were inaccurate for predicting the relative 
physiological stress experienced by men working under relatively warm 
conditions. Rises of dry-bulb temperature and falls of wet-bulb tem- 
perature for which the effective temperature was unaltered resulted in 
a decrease of stress. They argued therefore that the influence of the 
dry-bulb temperature was over emphasized at the expense of the 
wet-bulb temperature. There was also insufficient correction for the 
deleterious effects of low air movement (< 100 ft/min.) in hot humid 
conditions, especially when the subjects were clothed and the wet 
bulb temperature was high. Further in hot dry conditions when the 
air movement is increased from 20 to 200 ft/min. the effective tempera- 
ture scale indicated progressive improvement with increasing air 
movement instead of deterioration. 

On account of these defects in effective temperature McArdle and 
his colleagues were led to construct an alternative index of stress based 
on sweating rates. Using the results of nearly 1000 individual experi- 
ments they constructed an empirical nomogram from which the “pre- 
dicted 4-hour sweating rate’ —P4SR—for any set of working conditions 
could be ascertained, provided the environmental factors, the metabolic 
cost of the work and the clothing worn were known. The nomogram 
gives the P4SR for given wet-bulb temperature (°F.), dry-bulb tem- 
perature (°F.), air velocity (ft/min.) and metabolic rate (kcal./m/hr.). 
Predicted 4-hour sweat loss would have been a better term, for it 
is the amount of sweat lost in 4 hours which is maa from the 
nomogram. 


This was the position when the Tropical Research Unit at Mine 
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started work in January 1949. Its main task was to test the conclusions 
of the workers at the National Hospital, London, with young men as 
subjects who were naturally acclimatised to living in the Tropics. 
Among the objectives at- Singapore were: 


‘(1) To investigate in the Tropics the ability of men to withstand 
varying combinations of 
(a) differing dry and wet bulb temperatures and air movements 
(and later varying mean radiant temperature—air tempera- 
ture gradients), 
(b) different clothing systems, 
(c) work involving various metabolic costs, 
(d) varying periods and intervals of exposures to these con- 
ditions. 
(2) To estimate the predictive accuracy of the “P4SR”’ scale. 
(3) To assess the value of the Effective Temperature scale for 
grading the severity of thermal conditions in relation to human 
activities in the Tropics. 


Physiologists have attached great importance to obtaining a com- 
prehensive index of stress. Effective temperature, for instance, is 
widely used by ventilation engineers. P4SR is another such index. 

Really, of course, this is a double discriminant function problem. 
On the one hand, we have certain environmental variables: wet bulb 
temperature, dry bulb temperature, air velocity, work rate, type of 
clothing. These might be called the stress variates. On the other, we 
have variables which measure the physiological responses such as 
sweat rate, temperature, pulse rate. These might be called the strain 
variables, to use a distinction with an obvious physical analogy, sug- 
gested by my colleague, Dr. Macpherson. The problem is to find the 
best functions of each set of variables for making a prediction of strain 
from stress. I doubt whether canonical correlation technique could be 
used, but to state the problem in this form makes the aim of the work 
clear. 

One needs to emphasise that in this sense P4SR is an index of stress, 
not of strain. Once the nomogram had been constructed, P4SR was 
effectively a function of the climatic variables and work rate, and its 
object. was to predict strain. 


3. AN EXPERIMENT 


I propose to describe the first experiment which was carried out at 
Singapore. The object was to determine the effects on men naturally 
acclimatised to the Tropics of exposure for four hours twice weekly 
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to varying combinations of air temperature, humidity and air movement. 
Though not an ideal experiment, it was, I think, a great deal better than 
anything which had been done before. In particular, it shows how one 
may have to adapt statistical analysis to a situation which is by no 
means envisaged in a text book. 

Table 1 shows the combinations of air velocity, dry bulb temperature 


TABLE 1 
ENVIRONMENTAL CONDITIONS OF THE EXPERIMENTS 
Wet bulb temperature (°F.) 

Dry bulb 
temp. (°F.) 

80 83 85 88 

1 1 

90 2 2 

3 3 3 3 

4 — 4 

100 3 3 3 3 

1 1 

120 2 2 

3 3 3 3 

4 4 


Numbers 1-4 denote air movements as follows: 
= 44 ft/min. 
86 ft/min. 
300 ft/min. 
500 ft/min. 


1 
2 
3 
4 
and wet bulb temperature actually used. They were designed to cover 
the same range as had been investigated in London. Originally a 
4 X 3 X 2 factorial arrangement had been suggested, but this was 
modified for technical reasons. If 83° and 88°F. wet bulb temperatures 
had been tested instead of 80° and 85° at all air velocities the design 
would have been easier to analyse, but it was felt that this advantage 
was outweighed by the possibility of men collapsing while working 
at 88° F. wet bulb temperature, and such collapses, in themselves 
undesirable, would have complicated the statistical treatment of the 
results in another way. 


There were 3 teams with 4 subjects in each—young naval ratings 
who volunteered from ships or shore establishments on the Far East 
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Station. Each team had two 4-hour periods a week in the hot room, 
Four work-clothing combinations were tested at each exposure: working 
in shorts; working in overalls; resting in shorts; resting in overalls. 
Work consisted in step-climbing according to a certain routine. These 
four categories have been called ‘‘Postures’” for convenience. They 
may be allocated to 4 subjects in 24 different ways, and one of these 
was assigned randomly to each of the 24 climate combinations, separate 
randomisations being used for each team (Table 2). 


TABLE 2 
CoMBINATIONS OF CLIMATE AND POSTURE 
Climate combinations randomised Persons randomised 
(Separately for each team) 1 2 3 4 
I a b c d 
II b a d c 
III c d a b 
IV d c b a 
V a c b d 
VI b d a c 
VII d b c a 
VIII ¢c a d b 
Ix a d b c 
x d a c b 
XI b c a d 
XII c b d a 
XIII a b d c 
XIV b a c d 
XV c d b a 
XVI d c a b 
XVII a c d b 
XVIII b d c a 
XIX d b a c 
XX c a b d 
XXI a d c b 
XXII d a b c 
XXIII b c d a 
XXIV c b a d 
Posture: a: Shorts working c: Overalls working 


b: Shorts resting d: Overalls resting 
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In this plan all separate climate and posture comparisons were 
unconfounded with differences between persons and were therefore 
equivalent to comparisons on the same persons. The error term for 
these was originally intended to be based on such climate-posture 
‘interactions as were not confounded with personal differences. In 
testing the climate-posture interactions, it was proposed originally to 
ignore differences between persons within teams. It was not realised 
that these interactions were as important as they proved to be. Further, 
by what was subsequently recognised to be an error of judgment, the 
12 subjects were, on the basis of a uniformity trial carried out before the 
main trial started, divided into four grades of sweating with three 
subjects in each and one member of each grade was put in each team. 

Thus, there were two difficulties to be faced in the analysis. The 
arrangement of the climatic variables was not factorial, and it was not 
possible to allow for personal differences by the analysis of variance 
itself. The only way to correct for differences between persons was by 
analysis of covariance on the basis of the uniformity trial. To meet the 
other difficulty, the results of the trial were divided into two distinct 
sections (referred to in this report as Sections A and B) the first con- 
taining all combinations of 90° and 120° F. dry bulb temperature, 
80° and 85° F. wet bulb temperature and the four air velocities, and 
the second containing all combinations of 90, 100, 120° F. dry bulb 
temperature, and 80, 83, 85, 88° F. wet bulb temperature at the third 
air velocity (300 ft/min.). The analysis of covariance was carried out 
separately for the two sections. The combinations of 90° and 120° F. 
dry bulb temperature with 80° and 85° F. wet bulb temperature at an 
air velocity of 300 ft/min. occurred in each. . 

The statistical analysis was carried out for a number of response 
variables:— total sweat loss, total sweat loss per square metre of body 


surface, evaporative water loss (absolute and per square metre), final 


rectal temperatures, final pulse rates (seated and standing), comfort 
ratings and efficiency ratings. : 

Whenever an effect was found to be significant in the analysis of 
covariance, tables of the relevant means and their standard errors were 
compiled with a short paragraph headed ‘“‘remarks” briefly describing 
each effect. In this way the salient features of the results were treated 
systematically for the first time, I believe, in this particular field. I 
do not propose to trouble you with the details of the analysis of co- 
variance, but I give examples in Tables 3 and 4 of the manner of pre- 
senting the results. These particular tables refer to ‘“Total Sweat Loss”. 

The important conclusions from Tables 3 and 4 are:— 

(1) The direct posture effects are not completely informative if we do 
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TABLE 3 (SECTION A) 
Tora, Sweat Loss: AVERAGE PosTuRE AND AVERAGE CLIMATE EFFECTS 
(Mean values in grammes. No correction necessary) 


(i) Posture Means 


(a) Shorts working 2582 (c) Overalls working 2721 
(b) Shorts resting 1475 (d) Overalls resting 1409 
Shorts—Overalls —37 
Working—Resting 1210 
Interaction —103 
Standard error 38 (for each of the above means) 


(it) Climate Means 


Air Wet bulb | Dry bulb 
velocity temperature (°F.) Mean | temperature (°F.) Mean 
(ft./min.) 
80 85 | 90 120 
44 1964 2444 2204 | «1255 3152 2204 
(480) (1897) 
86 1768 1992 1880 1053 2707 1880 
(224) (1654) 
300 2045 1918 1982 947 3016 1982 
(—127) (2069) 
500 2041 2204 2122 805 3440 2122 
(+163) (2635) 
Mean 1955 2140 | 2047 1015 3079 | 2047 


Standard error of single means 53 


Remarks: The direct posture effects are not completely informative if we do not consider the 
interactions with climate as well. The excess sweat rate of working over resting subjects is the only 
important significant average effect; it is a little greater in overalls than in shorts. 

The general trend of sweat rate with increasing air velocity is a fall followed by a rise; the difference 
between the trends at wet bulb temperatures of 80 and 85°F. also show a fall followed wad a rise; the 
same is true of dry bulb temperatures of 90 and 120°F. 


not consider the interactions with climate as well. The excess sweat 
rate of working over resting subjects is the only important average 
effect; it is a little greater in overalls than in shorts. 

(2) The general trend of sweat rate with increasing air velocity is a 
fall followed by a rise; the difference between the trends when the wet 
bulb temperature was 85° and 80° F. also shows a fall followed by a rise; 
the same is true when the dry bulb temperature was 90° and 120°. 
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TABLE 3 (SECTION B) 
ToraLt Sweat Loss: AVERAGE PosTURE AND AVERAGE CLIMATE EFFECTS 
(Mean values in grammes. No correction necessary) 


(i) Posture Means 
Air velocity 300 ft/min. 


(a) Shorts working 2350 (c) Overalls working 2612 
(b) Shorts resting 1477 (d) Overalls resting 1387 
Shorts—Overalls —86 
Working—Resting 1049 
Interaction —177 


Standard error, 38 (for each of above means) 


(it) Climate Means 
Air velocity 300 ft/min. 


Wet. bulb Dry bulb temperature (°F.) Mean 
temperature (°F.) 90 100 120 

80 990 1606 3101 1899 

83 969 1577 3015 1854 

85 904 1734 2932 1856 

88 1115 1867 3666 2216 

Mean 994 1696 3179 1956 


Standard error of single means, 66. 


Remarks:The following effects may be noted when the air velocity is 300 ft/min. The direct 
posture effects are not completely informative if we do not consider the interactions with climate as 
well. The excess sweat rate of working over resting subjects is the only important significant average 
effect. It is a little greater in overalls than in shorts. ; 

The average sweat rate increases greatly and steadily as the dry bulb temperature increases from 
90 to 120°F. When the wet bulb temperature increases from 80 to 88°F. there is a small fall followed 
by a rise which is somewhat greater at the higher dry bulb temperatures. 


On the average, the sweat rate is a little greater with the wet bulb 
temperature at 85° F. than at 80° F. and very much greater when the 
dry bulb temperature is at 120° F. than at 90° F. 

When the air velocity was 300 ft/min. the sweat rate increased 
greatly and steadily as the dry bulb temperature increased from 90° 
to 120° F. When the wet bulb temperature increased from 80° to 
88° F. there was a small fall followed by a rise which is somewhat 
greater at the higher dry bulb temperatures. 

(3) The average sweat rate in shorts is greater than in overalls when 
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Torat Sweat Loss: INTERACTIONS OF PosTURE AND CLIMATE 
(Mean values in grammes, corrected for personal differences in uniformity trial) 
(i) Air Velocity: Dry bulb Temperature: Posture 


Dry bulb temperature (°F.) 


Air 
velocity 90 120 
(ft/min.) (a) (b) (c) (d) |Sh—O}| (a) (b) (c) (d) |Sh-—O 
44 1791 494 2045 691 |—226 || 3746 2177 4155 2531 | —382 
86 1499 459 1705 550 |—148 || 3234 2120 3593 1881 | — 60 
_ 300 1305 459 1493 531 |—130 || 3601 2698 3531 2236 | +266 
500 1198 303 1372 347 |—109 || 4283 3095 3876 2504 | +499 
All standard errors for single means are between 106 and 110 
Air Difference in loss (120-90°F. dry bulb) 
velocity 
(ft/min.) (a) (b) (c) (d) Sh-—O W-R Interaction 
44 1955 1683 2110 1840 | *-156 271 + 1 
86 1735 1661 1888 1331 + 89 316 —242 
300 2296 2239 2038 1705 +396 195 —138 
500 3084 2793 2503 2158 +608 318 — 27 
Mean 2268 2094 2135 1759 +234 275 —102 
All standard errors for single means are between 150 and 160 
(it) Air Velocity: Wet Bulb Temperature: Posture 
Air Difference in loss (85-80°F. wet bulb) 
velocity 
(ft/min.) (a) (b) (c) (d) Sh-—O W-R Interaction 
44 849 179 339 555 67 227 443 
86 29 —65 671 261 —484 252 —158 
300 -—213 -61 -—215 21 173 19 
500 86 82 448 34 —157 209 —205 
Mean 236 —4 349 159 —138 215 12 


All standard errors for single means are between 150 and 160 


(a) = Shorts working 
(b) = Shorts resting 


Sh = Shorts 
O = Overalls 


(c) Overalls working 

(d) = Overalls resting 
W = Working 
R = Resting 
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Remarks: (i) At air velocities of 300 and 500 ft/min. the average sweat rate in shorts is greater 
than in overalls, when the dry bulb temperature is 120°F.; but when it is 90°F. the reverse is true. 
The “cross-over’’ is at a point below 100°F. for resting subjects but higher for working subjects. At 
the two lower air movements, the sweat rate is higher in overalls than in shorts, at both 120 and 90°F, 

(ii) The especially large values —484, +443 need discussion. With an air velocity of 86 ft./min. 
the sweat rate is on the average lower in overalls than in shorts when the wet bulb temperature is 
80°F. but higher in overalls than in shorts when it is 85°F. This is mainly due to the anomalous 
values for shorts for the 85°F. wet bulb/120°F. dry bulb temperature combination. 

When the air velocity is 44 ft./min. and the dry bulb temperature 120°F., in working subjects 
the sweat rate with overalls is appreciably above that with shorts when the wet bulb temperature is 
80°F. and about the same at 85°F.; in resting subjects the reverse is the case. This effect does not 
occur at this air movement when the air temperature is only 90°F. 


TABLE 4 (SECTION B) 
Tora Sweat Loss: InTERACTIONS OF PosTURE AND CLIMATE 
(Mean values in grammes, corrected for personal differences in uniformity trial) 
(i) Posture: Dry Bulb Temperature (Air Velocity 300 ft./min.) 


Dry bulb Posture 
temp. (°F.) (a) (b) (c) (d) Sh-—O W-R Interaction 
90 1416 444 1536 582 -—129 963 9 
100 2055 1213 2378 1138 —124 1041 —199 
120 3578 2775 3921 2440 —4 1142 —329 


All standard errors for single means are between 65 and 73 
(it) Posture: Wet Bulb Temperature (Air Velocity 300 ft./min.) 


Wet bulb Posture 

temp. (°F.) (a) (b) (c) (d) Sh-—O W-R Interaction 
80 2329 1490 2394 1384 21 925 — 8 
83 2190 1461 2430 1335 — 57 912 —183 
85 2335 1420 2421 1251 42 1043 —128 
88 2545 1539 3203 1579 —349 1315 —309 


All standard errors for single means are between 75 and 84 


Remarks: (i) The following effects may be noted when the average air velocity is 300 ft./min. 
The sweat rate is on the average greater in overalls than in shorts when the dry bulb temperature is 
90 and 100°F. but not when it is 120°F. The sweat rate in working subjects is greater in overalls 
than in shorts and increasingly so with a rise in dry bulb temperature (At 120°F., this result is different 
from Section A; this is due to the inclusion of the results for 88°F. wet bulb temperature in the average.) 
in resting subjects—it is less in overalls than in shorts except at the dry bulb temperature of 90°F. 

(ii) The sweat rate in shorts is on the average about the same as in overalls except at the wet 
bulb temperature of 88°F.; but at 88°F. the average sweat rate in overalls is considerably higher than 
in shorts. For working subjects the average sweat rate is higher in overalls than in shorts at all four 
wet bulb temperatures; in resting subjects it is lower in overalls than in shorts, except at 88°F. where 
it is about the same. 
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the air velocity is 300 and 500 ft/min. and the dry bulb temperature 
is at 120° F.,* but at 90° F. the reverse is true. The “cross over’ point 
is below the dry bulb temperature of 100° F. for resting subjects but 
higher for working subjects, for whom it depends partly on the level of 
wet bulb temperature. At the two lower air movements the sweat 
rate is higher in overalls than in shorts at dry bulb temperatures of 
both 120° and 90° F. 

When the average air velocity is 300 ft/min. the average sweat 
rate in working subjects is greater in overalls than in shorts for all four 
wet bulb temperatures and increasingly so with a rise in dry bulb 
temperature; in resting subjects it is less in overalls than in shorts 
except for the dry bulb temperature of 90° F. and 88° F. wet bulb 
temperature, when it is about the same. 


4. RELATION OF THE EXPERIMENTAL RESULTS TO P4SR AND 
EFFECTIVE TEMPERATURE 


The following method was used to compare the “total sweat rates” 
obtained from this series of experiments with the P4SR values obtained 
from the nomogram constructed by McArdle and his colleagues. 

In Section A there were available 64 climate-posture combinations; 
in Section B, 48. The regressions of y on x, were calculated for each 
series separately 


where y = corrected mean total sweat 
xz, = P4SR value from the nomogram. 


An analysis of variance was then performed on the deviations from 
regression, so that it was possible to see for which treatment combinations 
departures from the expected values were significant and to examine 
all such effects. 

The two regression lines proved to be: 


Section A Y = 2047 + 0.9783(2, — 1968) 
Section B Y = 1957 + 0.9248(2, — 1890) 


These two lines agree well; they do not differ significantly in position 
or slope. They both yield positive values for ‘total sweat’? when the 
P4SR is zero; these are 122 + 46 and 209 + 39. Both values are small. 
Had the lines been assumed to go through z, = 0, Y = 0 the slopes 
would have been 1.023 and 1.006. Thus, on the average, the total 
sweat rates in this experiment agree closely with those predicted from 


*This is true at 500 ft./min. for working and resting subjects separately. It is true for working 
subjects at 300 ft./min. if only the results for 80°F. wet bulb and 85°F. wet bulb are included in the 
average (though the difference is small); but not if all four levels of wet bulb are included. It is true 
for resting subjects at 300 ft./ min. 
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the nomogram for the same working conditions. The overall correlation 
coefficient between them is 0.96. 

Nevertheless, there are significant differences between the devia- 
tions from regression of particular treatment combinations, and these 
have all been examined carefully. 

The most important of these departures from expectation can be 
expressed in the following way. 

In London, men wearing overalls had a higher sweat loss than men 
wearing shorts over the whole range of climatic conditions. Con- 
sequently, the values predicted from the P4SR nomogram had this 
same property. In Singapore, however, while this was true at a dry 
bulb temperature of 90° F., at 120° F. the total sweat loss was in general 
greater for men wearing shorts at all the air velocities except the lowest. 
A similar method was used to compare the “total sweat rates” obtained 
from this series of experiments with their expected values calculated 
from the regression of y on “effective temperature.” 

The two regression lines are: 


Section A Y = 2047 + 208.4(z, — 86.50) 
Section B Y = 1957 + 209.6(z, — 86.05) 
with y = corrected mean total sweat 

2, = “effective temperature” 


The results for Section A and B agree closely, but whereas the overall 
correlation of total sweat with P4SR is 0.96, with effective temperature 
it is only 0.78. Thus the deviations of total sweat rates from the values 
expected on the basis of effective temperature are very considerably 
greater than on the basis of P4SR. These deviations were also examined 
carefully. It was apparent, however, that the main cause was the 
difference in position of the regression lines for working and resting 
subjects considered separately (the slopes were not significantly differ- 
ent). This might have been anticipated because the effective tempera- 
ture scale was originally constructed for resting subjects, and would not 
necessarily apply in the same way to working subjects. In fact, in 
terms of physiological reactions or of comfort an effective temperature 
of, say, 85° does not mean the same thing for working as for resting 
subjects. In other words, if separate regressions for working and resting 
subjects are calculated for P4SR the two regressions form one line, for 
effective temperature there is a difference in position but not in slope. — 
Table 5 makes the matter quite clear. It gives the average correlations 
within the four work-clothing groups and for all four groups combined. 
Within groups effective temperature is almost as good a predictor as 
P4SR and a rather better one for rectal temperature and pulse rate. 
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TABLE 5 
CoEFFICIENTS OF AVERAGE CORRELATION, WiTHIN Work CLOTHING GROUPS AND 
For ALL Four Groups CoMBINED, OF ENVIRONMENTAL INpDicES P4SR anp 
“EFFECTIVE TEMPERATURE” WITH PHYSIOLOGICAL OBSERVABLES 


Physiological P4SR Effective temperature* 
observable Within Groups Within Groups 
groups combined groups combined 


Total sweat 0.98 0.96 0.93 (0.91) 0.80 (0.78) 
End rectal temperature 0.84 0.74 0.90 (0.86) 0.48 (0.48) 
Pulse rate 0.86 0.54 0.90 (0.86) 0.63 (0.63) 


*These values allow for non-linearity of the regression; the values in brackets are the Ist order 
correlation coefficients. 


We have summarised the comparison of P4SR and effective tem- 
perature as follows: 


“The Singapore observations on sweat loss and changes in body temperature 
and heart rate in general support the conclusion of the London workers that the 
effective temperature scales are not completely adequate to describe the general 
pattern of changes in these variables for varying wet bulb temperatures and air 
velocities, but they indicate that the effective temperature scales over-emphasize the 
contribution of the wet bulb temperature to thermal stress and not the dry bulb 
temperature as the London workers suggested. 

If each work-clothing combination is taken separately, the predictive accuracies 
for these “naturally” acclimatised naval ratings of the effective temperature scales 
and the “predicted four-hour sweat rate’? nomogram constructed by the London 
workers are about the same, though there is a slight advantage to the latter in 
predicting sweat loss. However, when the results of all groups of experiments are 
combined, correlations with effective temperature are considerably lower than with 
“predicted four-hour sweat rate”. This is due to the fact that the effective tempera- 
ture scales make no allowance for differences in work rates. They were not designed 
to do so, being intended originally only for comparisons when clothing and work 
conditions were held constant. In this sense “predicted four-hour sweat rate” is a 
more comprehensive index. It also gives a more adequate picture of the change in 
stress with air movement. In hot conditions effective temperature does not allow 


_ adequately for the reduction in stress as air velocity is increased to 100 ft/min. and 


indicates too great an improvement at 200-500 ft/min. On the other hand, the 
concept of predicted 4-hour sweat rate can only be applied within the range of 
climate-work-clothing combinations which cause people to sweat. It cannot replace 
effective temperature under the more comfortable and desirable conditions of light 
and sedentary work with which it was designed to deal primarily, for sweating will 
not occur under these conditions. In this sense it is less comprehensive than effective 
temperature, but it is a more accurate index of physiological effect under conditions 
of thermal stress.” 


Finally it is worth mentioning that the relation between the ob- 
served body-temperatures and pulse rates at the end of the experiments 
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and the four-hour sweat loss predicted from the nomogram has been 
used tentatively to define the upper tolerable levels of warmth (those 
levels above which an increasing number of men will fail to complete 
their work) for working men in the same state of acclimatisation as the 
Singapore subjects and engaged in activities involving a similar rate 
of energy expenditure. This indicates that these levels are reached 
when climate/work/clothing combinations correspond to a four-hour 
sweat loss of 3.5 litres predicted from the nomogram of McArdle et al 
(1947). 


5. CONCLUSION 


I have tried to illustrate by an example of work done at Singapore 
a few aspects of a very wide subject. 

The experiment described was one on men naturally acclimatised 
to the tropics and made possible a comparison of their performance 
with that of men artificially acclimatised in London. Subsequent 
experiments of similar type at Singapore tested the performance of 
men artifically acclimatised with a similar routine to that used in London 
and of ship-acclimatised seamen who were brought to the laboratory 
for a single exposure to work at high temperatures. 

There was not a great deal of difference in the levels of response to 
work at high temperatures of the artificially acclimatised men in London, 
artificially acclimatised men in Singapore, naturally acclimatised men 
exposed twice a week in Singapore and ship acclimatised men in Singa- 
pore, although the latter two groups were, if anything rather less tolerant 
than the highly trained laboratory subjects in the first two groups. On 
the other hand, Dr. J. S. Weiner has repeated the Singapore experiment 
on ship-acclimatised men with completely untrained subjects at Oxford 
to show that completely unacclimatised men in England are very 
much less tolerant than any of these groups. It seems fairly certain 
that for short exposures to work at high temperatures, conclusions 
derived from the study of heat acclimatised subjects in London are 
broadly applicable to heat-acclimatised men working at high tempera- 
tures in the tropics. 

Later experiments at Singapore examined the effects of radiant heat 
very carefully. There were trials with living subjects; also some interest- 


ing cross-checks on the calculation of mean radiant temperature were — 


obtained by the use of artificially constructed metal men. 

Further, an extensive series of trials was carried out by Mr. R. D. 
Peplar, the psychologist to the unit, on the effects of high temperatures 
on mental performance. However, these do not fall within the limits 
of the present discussion. 
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The Tropical Research Unit at Singapore was directed by Surgeon 
Commander F. P. Ellis. To him and to Dr. R. K. Macpherson, the 
physiologist to the unit, I am much indebted for the interest they have 
given me in this work; but much more important is the indebtedness 
to them of the subject of Climatic Physiology. 
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CONFIDENCE LIMITS FOR MEASURING THE PRECISION 
OF BIOASSAYS* 


C. I. Buiss 


The Connecticut Agricultural Experiment Station and Yale University, New Haven, 
Connecticut, U.S.A. 


An experiment for determining how much of one preparation, the 
“Unknown”, is needed to produce the same reaction in living material 
as a stated amount of a second preparation, the “Standard’’, is known 
as a bioassay. Bioassays may be divided experimentally into two 
types, (1) those where the dependent variable is a threshold dose 
measured directly in each test animal, and (2) those where the dependent 
variable is the size of the reaction to dosages fixed by the experimenter. 

In the few assays of the first type, each threshold dose can be trans- 
formed to its logarithm and the log-relative potency (M’) computed 
either as the difference between two mean log-doses, or as a mean differ- 
ence. Both have confidence intervals that are well-known and simple. 
In the many assays of the second type, the difference in the response to 
the two preparations must be converted to units of dose. If the response 
plots linearly against the log-dose, the difference between the two mean 
responses is divided by the common slope of the log-dose response 
curves for the Standard and for the Unknown to obtain M’. If the 
response gives a straight line with arithmetic dosage units, potency is 
computed instead from the ratio of the slope for the Standard and that 
for the Unknown. In either case, the log-potency or potency depends 
upon the ratio of two statistics. The confidence limits for a ratio are 
more complex than those for a difference. 

In the form proposed by Marks (Fieller [1944]) for balanced cross- 
over assays, and applied later by Gridgeman [1951] to other factorial 
assays, the confidence or fiducial limits of a ratio are not difficult to 
compute. With little loss in simplicity, Marks’ equation can be ex- 
tended to all assays based upon the ratio of two statistics, and accord- 
ingly it has been adopted in U.S.P. XV [1955]. The purpose of this 
paper is to review the general equation and consider some of its exten- 
sions in sufficient detail to facilitate their understanding and use by 
the practising bioassayer who is not a professional statistician or 
biometrician. Since the calculation of each U.S.P. assay, including its 
confidence interval has been illustrated elsewhere (Bliss [1956a]), the 
numerical examples are restricted here primarily to non-official assays. 


*Published with the aid of an educational grant from the U. S. Pharmacopoeial 
Convention. 
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CONFIDENCE LIMITS FOR ASSAYS BASED UPON A DIFFERENCE 


In the assay for digitalis, the threshold dose is determined directly. 
A test solution is injected slowly into the alar vein of a pigeon until its 


_ heart stops, the amount injected being the threshold dose. This is 


determined in six or more pigeons with the Standard (S) and again 
with the Unknown (U). When thus measured directly, the log-threshold 
dose follows a normal distribution (Bliss [1944]). If each dose is trans- 
formed to its logarithm z, the log-relative potency M’ of the Unknown 
is the difference between the two mean log-doses or 


M’ = £5 — (1) 


The variance of a single z is computed from the Ns and Ny log-doses 
for each preparation as 


(xs)/Ns — ting (tv)/Nu}/n (2) 


with n = Ns + Ny — 2 degrees of freedom. The confidence limits of 
M'’ are given by the familiar equation 


Xu =M + + = M + (3) 


where ¢ is the tabular value of Student’s ¢ at P = 0.05 for n degrees of 
freedom and L is the length of the log-confidence interval. 

For the threshold dose of digitalis in pigeons the standard deviation 
is so small relative to its mean, that the log transformation can be 
omitted and essentially the same potency obtained directly from the 
ratio of the two mean threshold doses. This is the calculation given 
in U.S.P. XV. Although potency is computed more easily as a ratio, 
its confidence interval is more complex, as will be shown later. The 
anti-logarithms of the simpler limits in Equation (3) agree closely with 
those for the ratio of two mean threshold doses (Bliss [1956a]). 

A second drug assayed from the threshold dose is tubocurarine. 
A test solution is injected slowly into an ear vein of a rabbit until its 
head drops, the amount of tubocurarine injected being the threshold 
dose. The rabbit is revived and tested again the following day. Half 
of the rabbits, f in number, are injected first with the Standard and 
then with the Unknown, and the remaining half in the reverse order. 
The metameter z in each rabbit is the difference of the log-threshold 
dose of the Standard minus the log-threshold dose of the Unknown, 
so that here the log-relative potency of the Unknown is the mean 
difference for the 2f rabbits, or M’ = > x/2f = @. The error variance 
of a single z is computed from the variation within groups as 


v= 


| 

j 
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where T, = )> z for the rabbits injected first with the Standard, 
T, = >. z for those injected in the reverse order, and n = 2f — 2. 
The length of the log-confidence interval for a mean difference based 
upon 2f values of z is the standard error of M’, sy, , multiplied by twice 
the tabular value of Student’s ¢ at P = .05 for n degrees of freedom, 
giving 

L = 2st/V2f = (5) 
with limits Xu = jf’ 


CONFIDENCE LIMITS FOR A RATIO 


In most bioassays, relative potency is computed from the ratio of 
two statistics. The sample preparation or Unknown (U) is assigned 
a provisional potency in units of the Standard (S). Both preparations — 
are then tested at similar dosage levels in terms of their assumed and 
known potencies. The objective is to adjust this provisional potency. 
In a parallel-line assay, the response to both preparations is expressed 
in units y, which, over an adequate working range, can be plotted 
against their respective log-doses x as parallel straight lines with a 
common slope b. The difference between the two mean responses, 
Jv — Js = 4, is divided by the slope 6 to convert a to units of log-dose. 
The log-relative potency then has the simple form 


M’ = (Gu — Gs)/b = a/b (6) 


Both a and 0 are statistics or estimates subject to sampling errors. As 
the ratio of two statistics, M’ is also a statistic. It is an estimate of 
the true relative potency of the Unknown, which we will call u, and since 
it is only an estimate, we need a measure of its precision. 


The basic equation. 


The precision of M’ is measured in terms of the interval L between 
an upper and a lower confidence limit. These limits are so spaced 
above and below M’ that, on the average, they will include the true 
log-relative potency yu in a selected proportion of assays, commonly in 
19 out of 20 (1 — P = 0.95). Since the responses y are normally dis- 
tributed, the statistics a and b from which M’ is computed are also 
normal, with variances v,, and v,, and covariance v,, . Moreover, 
a and b are unbiassed estimates of the average difference in response 
and of the average slope, respectively. It follows that the difference 
a — ub, in unlimited repeated assays, is also normally distributed with 
a mean of zero. The variance of the difference a — ub is then 
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These definitions lead readily to the confidence limits for » as the 
roots of a quadratic equation (Fieller [1944]). Since the variances of 
a and b in any given assay are based necessarily upon limited informa- 
tion, the distribution of the ratio (a — ub)/V V is that of Student’s ¢. 
Confidence limits for yw are obtained by solving the inequality 
—t < (a — ub)/VV < ¢ for u, or the equivalent equation 


(a — pb)? — ?V =0 
Usually, ¢ is the tabular value for P = 0.05 of Student’s ¢, with appro- 
priate degrees of freedom, where P is the proportion of assays in which 
the confidence interval does not include uw. Substituting for V its 
definition, we have the quadratic equation 
— — Qu(ab — v,,t?) + (a’ — = 0 
Solved for / - roots of this equation are the required confidence 
limits, x M’ » 
This equation has appeared in a variety of forms (Bliss [1945]; Finney 
[1952b]). For conversion to that proposed by Marks (Fieller [1944]) let 
C = b°/(b* — vt’) (7) 


from which C — 1 = v,°C/b’.* Substituting C and M’, the above 
equation for Xy- reduces algebraically to 


Xue = CM’ — K + V(C — 1)(CM” + 0,./%3) + K(K — 2CM’) (8) 


or 


Xue = CM’ — K + 3L 


where K = (C — 1)v,,/v,, and L is the length of the confidence interval. 
When the numerator and denominator for M’ = a/b are independent, 
so that v,, = 0, all expressions involving K vanish and Equation (8) 


simplifies to 


Xue = CM’ + V(C — 1)(CM” + = CM’ + 4L (9) 


Although a, 6 and M’ here have specific meanings, these equations can 
be applied to the ratio of two statistics with other definitions for a and b. 

The larger the inequality b’ > »,,/’, the more nearly C approaches 
unity and the shorter is the interval L. L is a convenient index to the 
inherent precision of an assay but not always to its accuracy, since the 


*To avoid confusion with other terminologies, note that C in Equation (7) is equivalent to C* 
in publicati by the p t author (1945, 52] and to 1/(1 —y) in those by Finney [1952a, b]. 
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potencies computed from independent replicate assays may differ more 
than would be predicted from their individual confidence intervals. 
When such a discrepancy occurs, it may be due to chemical instability, 
lack of uniformity in the drug, differences in moisture content, or some 
other variable that changes between but not within assays. Fortunately, 
most assays are either free of this complication or not sufficiently precise 
for it to be detected. 

The log-relative potency M’ from Equation (6) is easily converted to 
the log-potency M by the equation 


where Zs and Zy are the mean log-doses of the two preparations. Their 
difference in a balanced assay is equal to log R, where R is the ratio of 
a given dose of the Standard to the corresponding dose of the Unknown. 
Es , £, and R are assumed to be controlled measurements with negligible 
sampling errors. To obtain the confidence limits of M from those of 


M’, we need only add Zs; — Zy , or log R, to the upper and lower limits 
of M’, 


General confidence limits without covariance 


Wherever possible, parallel-line assays are designed so that a and b 
in Equation (6) are independent and the confidence interval can be 
based upon Equation (9). Several Unknowns, A in number, are some- 
times assayed concurrently against the same Standard. The entire 
assay may then be computed as a unit, on the assumption that the 
h + 1 preparations act similarly, in terms of the response y. If true, 
their log-dose response curves will have the same slope b within the 
experimental error. When the assay slope can be computed from the 
combined evidence of several preparations, its variance and C are 
reduced correspondingly. 


To compute the assay slope, the following terms are determined 
from the k log-doses x for each preparation: 


and 
[xy] = (xT) — (fz) T./N. 


where T, = >> y at each 2, f is the frequency or number of y’s in each 
treatment total T, , and N; = )> f for a given preparation. These 
sums of squares and products are totalled over the h + 1 preparations 
to obtain the assay slope . 


ES 
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with which each M’ is computed by Equation (6). The variation in 
y accounted for by b is 


If the responding units have been assigned to each treatment 
entirely at random and there is ample replication, the error variance 
of the assay may be computed from the variation of the N values of y 
within treatments as 


(13) 


where the degrees of freedom n = N — >> k. The variance of the slope 
bis 


and from Equation (12), b? = B®>-[z’]. Substituting these terms in 
Equation (7), we have 


Vo 


C = B/(B’ — (14) 


where ¢ is the tabular value of Student’s ¢, usually at P = 0.05, for the 
degrees of freedom n in s’. 

One other term is needed to solve Equation (9), namely the ratio 
Vao/Ys, - The variance of the ‘difference 7y — Js = a is computed 
with the error variance of the assay (s”) as 


From the variance of the slope v,, , defined above, the ratio 


= + i} Lie’) (15) 


Note that this ratio depends only upon the design of the assay and not 
upon its outcome. 


THE CONFIDENCE INTERVAL FOR SELECTED PARALLEL-LINE 
ASSAYS WITH A GRADED RESPONSE 


Since they depend only upon the design, the constants in a con- 
fidence interval can be determined in advance, a considerable advantage 
if the same design is to be used repeatedly. Those discussed below are 
factorial assays, involving the two factors of preparation and dosage 
level in balanced arrangements, with the same number of responses for 
each “treatment” or dosage level of a preparation. Factorial designs 
facilitate both the calculation of potency and tests of the underlying 
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assumptions of parallelism and linearity. In the absence of more reli- 
able estimates, the error variance may be based upon discrepancies 
from these assumptions. 


Balanced factorial assays with one Unknown 


The simplest factorial assay involves two preparations, the Standard 
and one Unknown, each at k dosage levels, which, in turn, are spaced 
at equal intervals z on a logarithmic scale. The test units, which may 
be hypophysectomized rats, blood pressure readings, tubes inoculated 
with micro-organisms, or some other biological indicator, are assigned 
at random, but in equal numbers, to each dose of each preparation. 
If a value is lost during an assay, it is replaced at the start of the calcu- 
lation, as described later. The f responses (y) for each treatment are 
totalled to obtain the T, ; these 7, lead directly to the log-potency 
and its confidence interval. 

The 7; are multiplied by the factorial coefficients x, and x, in 
Table 1 and the products summed to obtain JT, = pa (z.7'.) and T, = 
>> (x,T,). The sums of the squares of these coefficients are e, = 
> 2; = 2k and e, = >> 23 = 2 >> (zi), where the z,’s are the coded 
log-doses for a single preparation. The difference between the mean 
response for the Unknown and that for the Standard is Jy — js = 
2T./fe. = T./fk. The combined slope of their log-dose response 
curves is b = 7',/fe,i’ = 67,i’/fk(k’ — 1)’, where 7’ is the log-dosage 
interval corresponding to an interval of 1 in the coded units z,. When 
the number of dosage levels k is odd, successive values of x, differ by 1 
and z’ = 7; when k is even, successive values of x, differ by 2 and 7’ = 7/2. 
Substituting in Equation (6), the log-relative potency of a balanced 
factorial assay is 


M' = ciT,/T, (16) 

where c = 2¢,1’/e,i = (k’ — 1)i/6i’, as given at the end of Table 1. 

Since )> 2,2, = 0, T, and T, in Equation (16) are independent, 

and the confidence interval for M’ is based upon Equation 9. In 

calculating C, B’ in Equation (12) reduces to B’ = T?/e,f, or C may 
be computed directly as , 


C = Ti/(T, — (17) 


The ratio v,,/v,, in Equation (15) may be written as c’i’, where c’ = 

c’e,/e, or, as Gridgeman [1951] has shown, c’ = (k’ — 1)/3; the values 
of c’ for k = 2 to 6 are given in Table 1. The length of the confidence 
interval then is 


L = 2V(C — 1)(CM” + (18) 
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TABLE 1 
FacrortaL Corrricienrs x ror ANALYZING A Ba Lancep Broassay, IN WHICH 
Successtve Loc-Doses or Sranparp (S;) AND OF SAMPLE OR “UNKNOWN” 
(U;) akE Spacep Equatiy, Eacu wit THE SAME NuMBER (f) oF RESPONSES 
ToraLuine 


Factorial coefficient x for each dose 
T; 
Design | Row 
S: Se Ss Ss Ss Se Ui U2 Us Us Us Us 
2,2 a |-1 -1 4|T. 
b -1 1 -1 1 
ab 1 -l -1 1 
3,3 e | -1 -1 11 1 6|T. 
b -1 1 -1 1 
ab 1 0-1 -1 0 1 4 | Tos 
q 1-2 1 1-2 1 1317, 
aq -1 2-1 1-2 1 | 
4,4 a -1 -1 -l 1 1 1 8:12, 
a |-3-1 1 38 —3-1 1 3 40 | 
Ps ab 3 1-1 -3 -3-1 1 3 40 | Tas 
| q 1 1-1-1 1 sift, 
ag |-1 1 1-1 1-1-1 1 
5,5 a -1 -1 -1 -l 1 10 17, 
b —2-1 0 41 2 —2-1 0 1 2 20 | Ts 
ab 2 1 0-1 -2 —2-1 0 1 2 20 | Tas 
q 2-1-2-1 2 2-1-2-1 2 238 | T, 
aq —2 1 2 1-2 2-1-2-1 2 28 | Tee 
6,6 a -1 -1 -1 -1 -1l 4 42 4 
b =—§ 13 5&8 -8-1 13 8 14017% 
ab 5 3 1-1 -3 -—5 -3 -1 1 
q 5 -1 -4 -4 -1 5 5 -1 -4 -1 5] 168|T, 
ag |-5 1 4 4 1-8} 5-1-4 168 | To, 
CoMPUTATIONAL CONSTANTS 
of For Equation | Constant Value in design 
4 computing 
: 2,2 3,3 4,4 5,5 6,6 
MW’ ase) |e 1 4/3 5 4 35/3 
L (18) | Pa 1 8/3 5 8 35/3 
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The factorial coefficients in rows ab, q and aq of Table 1 are for 
testing the validity of the assay. The treatment totals are multiplied 
in turn by the coefficients in each row and the products summed to 
obtain T; = >> (z;7,). The ratios in these three rows, T?/e,f, should 
not exceed the error variance s” significantly. That from row ab tests 
whether the log-dose response lines are parallel; those from rows q 
and aq test their linearity. If any ratio exceeds s’ as much as three- 
fold in an assay with three or more doses, the validity of the assay may 
be tested by computing the variance ratio, F = >> (T%/e,f)/3s*, for 
comparison with the tabular F for n, = 3 and n, = degrees of freedom 
in s’. 

The U.S.P. assay for Adrenal Cortex Injection is a two-dose factorial 
assay with the ratio of the two dosage levels fixed at 5:3 or 7 = 0.222 = 
2/9 approximately. Its log-relative potency can be computed as 
M’ = 2T,/9T, , and L determined with c’:? = 0.04922. An example 
of a three-dose U.S.P. assay is that for Corticotropin Injection, where 
the log-interval 7 is selected by the experimenter. 


Partially-balanced factorial assays 


A bioassay is based upon the central part of the log-dose response 
curve, where the response y is substantially a linear function of the 
log-dose x. If the assumed potency of an Unknown is too high or too 
low, the response to its smallest or largest dose may approach a lower 
or an upper limit, so that the Standard and Unknown no longer give 
parallel straight lines over all doses. Dropping an end dosage level 
of the Unknown may restore assay validity but sacrifice the initial 
balance in respect to k. 

Whenever the two preparations differ by one in the number of 
dosage levels and the log-dose interval 7 is constant, the assay can be 
computed and its validity tested with the factorial coefficients in Table 2, 
proposed by Wood [1953]. If the Unknown has the larger number of 
dosage levels, the coefficients for the Standard and the Unknown are 
interchanged and their signs in rows a, ab and ag reversed. The log- 
relative potency M’ and the confidence interval L are computed as 
before with Equations 16 and 18, but with c = e,/>,|2z,| andc’ = 
c’e,/e, , a8 at the end of the table. Because of the unequal number of 
dosage levels, M’ is converted to M with the first form of Equation (10). 

An example is provided by an assay of the vitamin C activity of 
fresh orange juice (Bliss [1952a]). The juice or Unknown was analyzed 
chemically for its ascorbic acid content, and on this basis fed to depleted 
guinea pigs at 0.5, 1.0 and 2.0 mg. of ascorbic acid per day. Pure 
ascorbic acid at the same dosage levels served as the Standard. When 
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TABLE 2 
FacroRtaL CoerricleNts POR ANALYZING A PartIALLY BALANcED Assay, 1N 
Wuicu Successtve Loc-Dosrs or STANDARD (S;) AND OF SAMPLE OR 
“Unknown” (U;,) are Spacep Equatiy, Each THE SAME 
NuMBER (f) or Responses T;, 
If the number of successive doses of the Sample exceeds by one the number for the 
Standard, interchange S; and U; in the heading and reverse all signs in rows a, 
ab, and ag. (Wood [1953]) 


Factorial coefficients x for each dose 
Design | Row e | T; 
S3 Ss Ss U; U2 U3 Us, Us 
2,2 a 2 617, 
b 0 31%, 
3,2 a —2 -2 -2 3 3 30 | T. 
b —2 0 2 -1 1 10 | T 
ab 1 0-1 —2 2 10.1 Tas 
q 1-2 1 0 0 6|T, 
4,3 a -3 -3 -3 -3 84 
b -3 -1 1 3 —2 0 2 28 | T. 
ab 3 -5 O 5 70 | Tab 
q 3-3-3 3 2-4 2 60 | T, 
aq -1 1 1-1 1-2 1 10 | Tae 
5,4 a —4 -—4 -4 -4 -4 5 &§ 56 & 180 | T. 
b -4-2 0 2 4 -3 -1 1 3 60 | 7', 
ab 2 1 0-1 -2 -1 1 3 30 Ts 
q | 2-1-2-1 2 1-1-1 1 1617, 
aq -4 2 4 2-4 7-7-7 7 252 | Tee 
6,5 a | -5 -5 -5 -5 -5 -5 6 6 6 6 6) 
b -5 -3 -1 1 38 5|-4-2 0 2 
ab 10 6 2-2 -6 -10/-14 -—7 O 7 770 | Ta 
q 10 -—2 -—8 -2 10 6 -3 -6 -3 6] 462/T, 
aq -5 1 4 4 1 8 -8 -—4 8 | 308 | T., 


CoMPUTATIONAL CONSTANTS 


For Equation | Constant Value in design 
computing 


2,1 3,2 4,3 5,4 6,5 


M’ (16) c 1/2 5/6 7/6 3/2 11/6 
L (18) c’ 3/4 25/12 49/12 27/4 121/12 
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computed as a 3, 3 assay, a significant departure from parallelism 
(P < .05) could be traced to a deficient response at the highest dose 
of the Unknown. With this one omitted, the remaining five doses 
gave the satisfactorily parallel 3, 2 assay in Table 3. 


TABLE 3 

A 3, 2 Assay 1n GuINEA Pics oF THE ANTISCORBUTIC ACTIVITY OF ORANGE JUICE 
(U) Rewative to Pure Ascorric Acip (S) 

Response y = length of the odontoblasts in the incisors. The y at Us in the original 
3,3 assay (7', = 260.6) gave a significant departure from parallelism and has been 
omitted in recomputing the potency. (Bliss (1952b)) 


| 


Term | Coefficients x for dose ei = T; Variance 
Si Ss OF U: > 2? T?/ fe; 
a 

| -2 oO 2 -1 1} 10 456.7 = 

ab | 1 0-1 -2 2| 10 9.4=T, 8.84 

q 1 -2 1 0 O| 6 7.4=T, 9.13 

T, | 79.8 166.5 260.6 131.9 227.0 Error (n = 45), s? 14.1080 
| 


i = log 2 = .30103, c = 5/6, M’ = .03455 (Eq. (16)) 


= 0, fy = —.15052, M = .18507 (10)), f = 10 
@ = 4.057, C = 1.02822 (Eq. (17)), ci? = .18879 
3L = .07323 (Eq. (18)) 


Factorial assays with unequally-spaced log-doses 


When equally-spaced log-doses are not convenient, as in some micro- 
bial assays, factorial coefficients can be developed for other sequences. 
The more evenly the selected log-doses cover the range, the simpler 
are the coefficients. The linear coefficients x, are small whole numbers 
with approximately the same spacing as the log-doses x and an average 
ratio between successive values of 7’ = > {2x,(x — #)}/ > zi. Any 
given set of three or more linear coefficients x, can be matched with 
orthogonal quadratic coefficients x, for testing curvature, as described 
elsewhere (Bliss and Calhoun [1954]). 

This design has been applied to microbial assays for the vitamins 
(Bliss [1956b]). With test solutions of a suitable concentration and 
dosages of 1.5, 2, 3 and 4 ml. per tube, the turbidimetric response to . 
vitamin B,, , for example, in terms either of (100% transmittance) or 
its logarithm, has usually defined a straight line when plotted against 
the log-dose. The corresponding linear coefficients are x, = —29, 
— 12, 12 and 29, and the quadratic coefficients are z, = 1, —1, —land1. 
The constants in the equations for computing M’ and L are derived as 
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before; for the above: sequence of four doses they are ci = 2i’e,/e, = 


7.2332 and ca” = (ci)*e./e, = 0.10623. Similar coefficients have been 
determined for other series of 3 to 6 doses. 


Assays with more than one Unknown 


- Two or more Unknowns may be assayed concurrently against the 
same Standard at equivalent dosage levels. Although the potency of 
each Unknown and its confidence interval can be determined inde- 
pendently, this both increases the amount of computing and sacrifices 
some of the precision inherent in the data. The Standard and the several 
Unknowns commonly will have linear log-dose response curves with 
nearly the same slope and the same error variance about these lines. 
Any questionable discrepancies can be tested for significance and the 
discordant preparations omitted. With more degrees of freedom, the 
assay variance (s’) and slope (b) have greater stability, and C will be 
computed with a smaller /’. Confidence intervals determined with 
these composite values, or their equivalent, average less than if based 
upon only part of the relevant data. 

The assay slope is computed by Equation (11), which for convenience 
may be written as 


b=2 > Tike LUM’ (19) 


where T; = = (x,T,) for each-preparation, 2’ is the dosage interval in 
logarithms (x) corresponding to a unit difference in x, , and > f is the 
sum over all preparations of the number of observations at a single 
dosage level. By analogy with Equation (16), the variation in y attri- 
butable to this 6 is 


Ls 
If treatments have been assigned to the test animals or other units 
entirely at random, the error variance for the assay may be computed 
by Equation (13) from the variation within treatments. Alternatively, 


the variation of treatment means about the fitted parallel lines may be 
included in the error, by calculating 


{Ly Lir’/kf) — (20) 


For each preparation, T’ = >> T, , summed over k dosage levels; the 
degrees of freedom n = N — h. Given B’ and s’, the slope factor C is 
computed with Equation (14). 

The number of responses f at each dose may be constant or may 
vary from preparation to preparation. If f is constant, M’ and L are 
computed with Equation (16) modified to 


M’ = cih’T,/2 >, T: (21) 
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and with Equation (18) adjusted to 
L = 2V(C — 1(CM” (22) 


where h’ = h + 1 is the number of preparations. In an assay with one 
Unknown, h’ = 2, >> Tf = T, and Equations (21) and (22) reduce to 
their original form. If f is not the same for all preparations, M’ may 
be computed by Equation (6) with b from Equation (19). Z is caleu- 
lated by Equations (9) and (15) with >> [x7] = ie, >> f/2 = 
i. 

Most microbial assays include several Unknowns as in the random- 
ized vitamin B,, assay in Table 4. The reading for each tube has been 


TABLE 4 
A Tursipimetric Assay IN THREE RANDOMIZED SETs OF THE VITAMIN Bj. ACTIVITY 
or THREE Unknowns (U, U’ anp U”) at Four DosaceE LEVELS, IN ML. OF 
Test SOLUTION PER TUBE 
Data from the Food Research Laboratories in a U.S.P. collaborative study, Decem- 
ber, 1954. 


A: Individual responses, y = 100 log (100 —% transmittance) — 100 


S for dose U for dose U’ for dose U” for dose Total 


62 8 445 3 4114.5 2 3 2 3 


I 43 48 54 62) 45 45 54 62) 41 46 56 59) 41 46 51 63) 816 
II | 38 49 56 62) 43 46 56 63) 40 45 53 61) 40 48 52 60) 812 
III | 38 45 54 61| 41 46 54 60) 34 45 51 54) 40 48 52 60) 783 


T, j119 142 164 185/129 137 164 185) 115 136 160 174) 121 142 155 183) 2411 


B: Calculation of M’ and 31 


Treatment tocal 7’, for dose | Total Laz: M' 4L 
Prepn. 7 Ta \(Eq. (21))|\(Eq. (22)) 
15 2 4 
S 119 142 164 185 610 | 2178 —2 
U 129 137 164 = 185 615 | 1948 13) 5) .00895 | .0443 
115 136 160 174 585 | 1999 —7|—25|—.04477 | .0445 
121 142 155 183 601 | 1954 —9|/—.01612 .0443 


Sum 484 557 643 727 | 2411 | 8079 11) M’ = .0017906 7. 


—12 12 29 | ci = 7.2332, h’ = 4, = .21246 
| 1 | s? = 5.2189, n = 11, = 4.844, C = 1.00924 
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coded to obtain the individual responses y. If the assay had not been 
randomized, as is often the case, the error variance would be based upon 
the variation of the T,’s about parallel, straight, log-dose response 
curves and this procedure has been followed here. In consequence, 
s* has been computed with >> y’ in Equation (20) replaced by 2. ys 
and with n = hk — 1 = 11 degrees of freedom. The coefficients z, 
and z, were determined by the unequal spacing of the log-doses. The 
values of T% = )-> (x.7',) indicated no consistent simple or quadratic 
curvature over the four preparations. The log-relative potency M’ 
and its half-confidence interval (}L) have been computed for each 
Unknown with Equations (21) and (22). 


Assays in balanced pairs 


When responses can be arranged in homogeneous pairs, the most 
efficient assay is a two-dose factorial twin cross-over design (Smith 
et al [1944]). Half of the pairs are given doses S, and U, and the other 
half S, and U,. Since a time or order sequence is usually involved, as in 
the insulin assay, the order of injection is reversed in half of the animals 
receiving each pair of doses, so that all four treatments are given on 
each test day. Four equal groups are injected in the following order: 


yroup or pair 
Item 

1 2 3 4 
lst dose Si U2 
2nd dose U; Us Si 

Difference for each y U.-S 
Response variate, y Yi Yo Ys Ys 
Total of f response, 7’; Ti T: 
Factorial S vs. U, 2a | -1 1 1 —1 
coefficients Slope, x, 1 1 1 
for Days 1 
Residual -1 —1 1 1 


The unit of response y in a twin cross-over assay is the difference 
between the two paired reactions, within each pair subtracting the 
reaction to the smaller dose from the reaction to the larger dose. Any 
losses during the assay are adjusted before totalling the y’s in each 
group to obtain T, = 7, to T,. From the definitions in Table 1 for 
a2, 2design, T, = —7T,+7,+ 7; — T,andT, = 7,+7.+T7; + 174; 
from Equation (16) the log-relative potency VW’ = iT,/T, . The 
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contrasts for the remaining two degrees of freedom among the four 
totals T, to T, measure directly the change in average sensitivity from 
the first to the second dose, and indirectly the deviation from parallel- 
ism (Finney [1956]). 

The error variance s” for computing the confidence interval depends 
upon the design of the assay. If each y represents a different animal, 
as in an insulin assay, and all are assigned at random but in equal 
numbers to each group and tested concurrently, the error variance of a 
single y is computed with Equation (13), where 7, = 7, to T,. In caleu- 
lating the confidence interval, C is determined with Equation (17) and 
e, = 4, and L with Equation (18) and c’ = 1. 


PARALLEL-LINE ASSAYS WITH POTENTIAL COVARIANCE 


In most but not all parallel-line assays, the design insures that the 
numerator of M’ will be independent of its denominator. Three con- 
ditions will be considered which may give rise to covariance in parallel- 
line assays. In contrast, covariance between numerator and denomina- 
tor is always present in slope-ratio assays. 


Assays in randomized sets 


Not infrequently, the responding units in an assay can be arranged 
in advance of treatment into relatively homogeneous groups or sets, 
such as of litter mates in a vitamin D assay (Bliss and Gyorgy [1951]) 
or repeated doses of drug in the same patient in a clinical assay (Bliss, 
[1952a]). Successive smooth muscle contractions in a histamine assay 
(Schild [1942]) have been treated similarly, although these involve no 
inherent discontinuity which coincides with the subdivision into sets 
and alternative adjustments for changes in sensitivity may be more 
effective (Finney [1956]). Each set contains as many units as there are 
treatments and within each set one unit is allotted at random to each 
treatment. In effect, the Unknown is assayed separately in each set, 
and a separate M’ could be computed for each if required. 

The error variance s° is usually determined from the interaction of 
the k treatments by the f sets as 


— Le + T’/kf)/n (23) 
where 7’, is the treatment total, T, the set total, T= >> T, = > T., 
and n = (k — 1)(f — 1). Finney [1952c] has noted, however, that this 
interaction could contain variance components which increase the 
apparent error of an assayed potency but not its real error. 


His reasoning may be summarized geometrically. In an analytical 
| assay, the potency of the test solution of an Unknown relative to that 
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of the Standard is presumably the same in all sets, so that the expected 
value of. M’ is constant. If two parallel log-dose response lines are 
fitted to the data of each set, the expected distance between them on 
the x axis (M’) is also constant. However, sets that are unequally 
sensitive to the drug may differ significantly in the slopes of these lines. 
If the horizontal distance M’ is to remain constant, a steeper slope (b) 
necessarily increases the vertical distance (a), leading to a positive 
correlation between the numerator and denominator of M’ = a/b. 
A significant variance component for slope would enlarge the inter- 
action of treatments by sets in Equation (23) but would not affect the 
precision of the assay. 

The presence of this effect is easy to test in a factorial assay. The 
responses y in each randomized set are multiplied by the factorial 
coefficients for the numerator (x,) of M’ and by those for its denominator 
(z,). The sums of these products are the paired values y, = >> (x.y) 
and y, = >, (xy), respectively, from which y, = T,and y, = T, 
in Equation (16). From the f values of y, and y, , the variances and 
covariance are 


= (do y2 — T2/f)/ne. 
Cov (yy) = (Do yoys — T.Ts/f)/nV ects (24) 
Viy) = — T3/f)/nes 


where n = f — l,e, = >, 22, ande, = >, 2}. The two variances, 
V(y.) and V(y,), have different expectations but both contain the 
postulated variance component for slope in addition to the true random 
sampling error, as does the covariance Cov (y.¥). 

The Cov (y,y,) can be tested for significance in terms of the cor- 
relation coefficient r = Cov (y.y,)/V(V(y.) V(ys). If significant, the 
confidence limits Xy, may still be determined with Equations (18) or 
(22) by computing C with a corrected error variance é°. To estimate 
é° the variances V(y,) and V(y,) in Equation (24) are removed from 
the usual error variance s° in Equation (23), giving 


= {(f — — (Ff — DIVYY.) + 


with n = (f — 1)(k — 3) degrees of freedom. If the slope varies sig- 
nificantly from group to group, é° should be smaller than s’. 

In some assays, the covariance in Equation (24) has proved significant 
(Leech and Grundy [1953]); in others, non-significant and negligible 
(Bliss [1952a,b]). One or both variances, V(y,) and V(y,), may be 
smaller than 6’, which would favor computing C with the simpler error 


variance in Equation (23). In an assay with several Unknowns, such. 


as in Table 4, the interaction of sets by preparations and of sets by 
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slope may be compared with the remaining interactions of sets by 
treatments, most conveniently by an analysis of variance (Table 5). 
If the mean squares for the two doubtful interactions were to exceed 
that for the remainder, either significantly or with an F approaching 
significance, é? would be more appropriate for computing C. The 
log-dose response curves for the several preparations may be tested for 


parallelism and for linearity as part of the analysis. 


TABLE 5 
ANALYSIS OF VARIANCE OF THE MICROBIAL ASSAY IN TABLE 4 
Row Source D.f. | Sum of squares | Mean square F 
1 | Among sets 2 40.542 20.271 
2 | Among treatments (15) (2861 . 812) 
3 | (1) Preparations 3 43.396 14.465 | (a) 3.65 
4 | (2) Assay slope 1 2761 .008 2761 .008 
5 | (3) Preparations X slope 3 5.913 1.971 (b) 1.25 
6 | (4) Non-linearity 8 51.495 6.437 | © 2.29 
7 | Sets X treatments (30) (82.125) (2.7375) 
8 | Sets X (1) 6 23.792 3.965 | (dd) 1.60 
9 | Sets X (2) 2 3.950 1.975 | (d) .80 
10 | Sets X (3) 6 9.448 1.575 — 
11 | Sets X (4) 16 44.935 2.808 — 
12 | Sets X (8) and (4) 22 54.383 2.472 — 


F values reported are computed with denominator mean squares from the following rows: (a), 
row 8; (6), row 10; (c), row 11; (d), row 12, 


In Table 5, neither the mean square for non-parallelism (row 5) 
nor that for curvature (row 6) exceeded significantly the corresponding 
mean square interaction with sets. When compared with the remaining 
interaction mean squares (row 12), those for sets with preparations 
(row 8) and with slope (row 9) did not approach significance. Hence, 
C could be determined with the overall assay error of s* = 2.7375 with 
n = 30, from row 7 or computed by Equation (23). Half-confidence 
intervals, recomputed for the three Unknowns with the new value of 
C = 1.00415, ranged from 3Z = 0.0297 to 0.0298. 


Losses in balanced assays 


A balanced factorial assay has an equal number of responses at all 
dosage levels of each preparation. A loss during the assay must be 
repaired before the log-potency or its confidence interval can be com- 
puted with the equations for a balanced design. If f animals have been 
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assigned to each treatment group at random, a missing value may be 
replaced by the mean of the (f — 1) responses in the incomplete group 
and its treatment total 7, recomputed. Hf the assay consists of ran- 
domized groups or Latin squares, the missing value is replaced by one 
computed by the appropriate formula (Bliss [1952b]; Finney [1952b)). 
With these replacements, Equation (16) gives an unbiassed estimate 
(M’) of the log-relative potency, but its confidence interval needs 
further adjustment. 

The principal corrections are in the slope factor C. For every 
replacement, the assay variance s° loses one degree of freedom, which, 
in turn, increases ?’. Less obviously, a missing value increases the 
variance of 7’, , if the treatment total 7, containing the replacement 
enters into its calculation. In effect, each treatment total is the treat- 
ment mean multiplied by f, ie., T, = f(T./f). If T, contains a re- 
placement, it is equal instead to f(T!/f’), where T/ and f’ are the 
initial total and frequency. As a consequence, the variance of 7’, is 
f? © (@/f’)s’, and C is increased. To avoid bias, it should be computed 
as 


(26) 


The larger the f, the smaller is the bias which this corrects. In a 
two-dose assay with a single Unknown and one replacement, for example, 
omitting the adjustment in Equation (26) would underestimate the last 
term in the denominator of C by the following percentages: 


f in complete groups 3 4 5 6 7 8 9 
Percent underestimate Ee | 7.7 5.9 4.8 4.0 3.4 3.0 


A missing value has less effect upon the variance of T, , which then 
becomes f” >> (x2/f’)s”. In ‘a two-dose assay, each x2 = x} , and the 
ratio v,./Vs. is identical with that for a complete assay. In all other 
factorial designs, c’ in Equation 18, or c’h’/2 in Equation 22, may be 
replaced by >> (x2/f’)/>. (z3/f’). Here the bias may be either 
positive or negative but is usually neglected when f > 5. If a single 
value is lost from a three-dose assay of a single Unknown with f = 5, 
for example, neglecting the correction overestimates L at most by 1.4 
percent, if the loss is from an end dose, or underestimates it by 2.0 
percent, if the loss is from a mid-dose. 

In a factorial assay with a replacement, the numerator and denomina- 
tor of M’ may no longer be independent. The covariance of T, and T;, is 


Cov. (T.T:) = 2. f 
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from which K in Equation (8) is computed as 


K = (C—1) (27) 


The losses may be so balanced that >> (z,2,/f’) = 0 as in a complete 
assay, so that the simpler Equation (9) applies. Even when not zero, 
> (x.7,/f’) is usually negligible. 

These corrections for lost readings have been examined for their 
effect upon a factorial assay of insulin with 6 y’s in groups 2 and 3, 
and 5 y’s in groups 1 and 4. The missing value in each of the latter 
two groups was replaced with the mean of the remaining observations 
before computing the log-relative potency M’ = —0.03522. The error 
variance with 18 d.f. was s’ = 214.08. After correcting for replacements 
by Equation (26), C = 1.03767, instead of C = 1.03413 by Equation 
(17) with the same s*t’. The confidence interval computed with the 
corrected C but without covariance was L = 0.1177. With K = 
—0.003424 by Equation (27), the confidence interval corrected for 
covariance was L = 0.1135, a difference of less than 4 percent. . 


The moving average adjustment for changing sensitivity* 


Some assays depend upon successive reactions of a single animal or 
piece of tissue, such as the fall in blood pressure of an anaesthetized 
chicken following a dose of posterior pituitary extract. The reaction 
depends not only upon the dose of drug but also upon the condition of 
the animal preparation, which may change slowly or rapidly. The 
four doses in a 2, 2 factorial assay may be administered in randomized 
sets (Schild [1942]), or by pairing the treatments, as in the U.X.P. XV 
assay for vasopressin. Recently, Finney [1956] has proposed still 
another approach. 

A rather different design has been adopted for the posterior pituitary | 
assay in U.S.P. XV. A single dosage level of the Standard (S) is in- 
jected at regular intervals, alternating with injections in a random order 
of a higher or a lower dosage leve! of the Unknown (U, and U,). An 
assay consists of f = 3 or more sets or pairs of U, and U, and the 
alternating one-level doses of Standard. If the sensitivity of the 
animal should change markedly, all doses following one of the Standard 
may be increased (or decreased) by a constant proportion, starting 
again with the Standard at its new level. 

For analysis, the successive reactions to the Standard are averaged 
in pairs to form a series of moving averages. Tach average is subtracted 
from the intervening reaction to the Unknown to obtain a unit response, 


*Referee comments have aided materially the revision of this section. 
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designated. as y, for a low dose of Unknown and as y, for a high dose. 
Corresponding values of x are obtained similarly from the log-doses; 
they average zero (€ = 0) if S is exactly midway between U, and U, 
on a logarithmic scale. In computing M’, jy — js = (T; + T.)/2f 
and b = (T, — T,)/tf, from which 


M’ = {i(T; + T,)/2(T, — T;)} (28) 


where 7, = Y2 , and 7 is the constant log-interval 
between U, and U, . 

The error variance of an assay is computed from the y (the y, 
and y2) as 


(Ti (29) 


where n = 2(f — 1). Each y, however, is the difference between the 
reaction to a dose of Unknown and the mean of two adjacent reactions 
to the constant Standard. Assuming that the initial reactions are inde- 
pendent and have a common variance s; , s’ = (1 + 3)s; or s; = 2s"/3, 
if we neglect the covariance between the y introduced by the moving 
average for the Standard. This covariance depends upon the number 
of times (r) the order of y, and y, is reversed in going from one set to 
the next. With no reversals (r = 0) it is zero and s; is unbiassed; with 
f = 3or4 andr = 2, the covariance exerts its greatest effect, and s” 
would be multiplied by 17/18 for an unbiassed estimate of s; . Hence, 
for all practical purposes, the estimates from Equation (29) have 
negligible bias. 

The confidence interval for M’ depends upon the variance and co- 
variance of 7, + T, and T, — T,. These can be expressed as a product 
of s; and a linear combination of the squares or products of the coeffi- 
cients 1 or } with which the original reactions are combined into T, + T, 
and T, — T,. For T, + T, or the numerator, these coefficients a, 
total zero and >> a2 = 4f — 4g, where g is equal to 1 plus the number 
of overall changes in dosage level during the assay. For T, — 7, or 
the denominator, the coefficients a, again total zero and = az = 2f + 
r + 39, where r is the number of reversals between overall changes in 
dosage. For their covariance, >> (a,a,) depends upon r and g; in the 
dosage patterns most likely to occur in practice, >. (a,a,) varies from 
—} to}. 

The confidence interval may be computed from Equation (8) with 


C = (T, — T,)*/{(T2 — — (4f + 2r + 
= (8f — g)/4(4f + 2r + g) 
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and 


K = (C — 1) © (a,a,)/2(4f + 2r + g) 


K is usually so small that it may be omitted and the confidence interval 
L calculated with Equation (9). 

To avoid computing a separate adjustment for each individual 
sequence, a second-order correction, a single pattern has been postu- 
lated for the routine assays in U.S.P. XV. On the assumption of one 
reversal in the order of y, and y, (r = 1), one change in the overall 
dosage level (g = 2), and negligible covariance, C reduces to 


C = (T. — T,)’/{(T2 — — Af + (30) 
and the ratio of the two variances in Equation (18) to 
= (4f — + 1) (31) 


These equations have been checked empirically by experimental 
sampling. With the averages from six assays in two laboratories as a 
guide, a hypothetical assay of posterior pituitary injection was con- 
structed. It was assigned an assumed slope of b = 37.5, a log-interval 
between U, and U, of i = log (1.5’) = 0.352, and a standard deviation 
for the initial reactions of s; = 1.5. Each mock assay consisted of 
three sets of doses (f = 3), had a true log-relative potency of u = 0.1, 
and required no overall change in dosage level. The reactions, 1 to 13, 
were assumed to decrease uniformly by steps of 1 millimeter from 34 
to 22. As reactions to the Standard, the odd-numbered values were 
unchanged; half of the even-numbered values, minus 2.85, were desig- 
nated as reactions to U, , and the remaining even-numbered values, 
plus 10.35, as reactions to U, . 

Three dosage orders were constructed, one with no reversal in the 
order of U, and U, , a second with one reversal (U,U,U,U,U,0,), and 
a third with two reversals (U,U,U,U,U,U,.). The random element 
was then added from a table of random deviates for a normal population 
with ¢ = 1 and mean = 0 (Dixon and Massey [1951]), multiplying 
each random deviate, either + or —, by 1.5 before adding it to the 
expected first to thirteenth reaction’ This process was repeated 30 
times for each of the three designs. Finally, each was analyzed as if 
it were an actual posterior pituitary assay, leading to an “assayed” 
potency M’ and error variance s’ by Equations (28) and (29), and to 
an approximate confidence interval based upon the “standardized” 
pattern by Equations (30), (31) and (18). 

The variability resulting from this model, with only one source of 
random variation and no residual effect of preceding doses, is summarized 
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TABLE 6 
Resutts or 30 Mock Assays ror Eacu oF THREE DosaGeE SEQUENCES OF A 
PostTeriorR Pirurtary INJEcTION, (THE ““UNKNOWN”’) Havine a “TRUE” 
Loa-PoTency OF » = 0.10 


Number of, ‘“Assayed” AI’ Average variance | Interval L from | Frequency 
reversals, of » outside 
r Within Between X ye 
Mean Range Observed [Expected] assays assays 
0 .1024 .0484, .1578) 3.395 3.375 .1574 = .1514 2 
4 1 .0992 .0515, .1423) 2.893 3.281 1420 =.1258 0 
4 2 1019 .0577, .1765) 3.063 3.188 | .1522 .1841 2 
4 
Combined |.1012 .0484, .1765) 3.117 3.281 -1507 =. 1556 4 
: for each design in Table 6. The mean of the “observed” M’ closely 


; . approximated the true value of » = 0.1 in each series of 30, but its 
= range was considerable. The ‘observed’ error variances differed but 

little from 3s;/2 = 3.375, and even less from expectations that had been 
7 corrected for the covariance introduced by the moving average (Table 
6). The average confidence interval, vs L’/30, based upon the 


variation within the individual assays, agreed satisfactorily with that 
j computed from the variation of the 30 values of M’ between assays, 
. 2ts,,. The single pattern postulated in Equations (30) and (31) gave a 


slightly smaller expected confidence interval of 0.1442, when solved 
with the values adopted in setting up the mock assays. None of these 
4 statistics differed significantly from their expectations. Four of the 
90 “observed” confidence intervals did not bracket the true value of 
uw = 0.1, in good agreement with the postulated number of 4.5. So 
far as can be judged from these mock assays, log-potencies and confidence 
intervals computed with the simplified equations should meet all 
practical needs. 


‘6 THE CONFIDENCE INTERVAL FOR ALL-OR-NONE ASSAYS 


In an “all-or-none”’ reaction, each individual reacts or fails to react; 
the extent of its reaction is not measured. Whether the animal reacts 
or not depends upon the relation between its threshold and the dose at 
the moment of testing. If the dose is larger than its threshold, the 
animal reacts; if less, it does not react. A group of animals (or plants) 
is tested at each of several dosages that give positive reactions between 
0 and 100 percent. When these percentages are plotted against the 


4k 
t 
4 
‘ 


—o Oo 


CONFIDENCE LIMITS 513 


log-dose, they usually define a symmetrical, sigmoid curve resembling 
the cumulative normal, as if the logarithm of the individual threshold 
dose in the original population were normally distributed. 

To test this hypothesis, each percentage reaction may be trans- 
formed to the corresponding normal equivalent deviate (N.E:D.) or 
probit (N.E.D. + 5) and plotted against the log-dose. If the data are 
consistent with the above interpretation, the resulting curve will approxi- 
mate a straight line and the variance of each probit will be a function 
of its expected response. To take full advantage of the information 
supplied by each group, the log-dose response line may be computed by 
maximum likelihood with working probits and weights (Bliss [1952b]; 
Finney [1952a]). In experiments based upon this relation, the confidence 
interval for the log-relative potency and for other statistics can be 
computed with Equation (9) or its equivalent, as will be described 
presently. 

All-or-none data may be analyzed with two other transformations 
of the percentage response, the logit and the inverse sine. The logit is 
based upon the logistic curve, which closely approximates the normal 
from about 10 to 90 percent response, but departs increasingly at the 
ends. It requires weighting coefficients and presents computational 
problems similar to those found in probit analysis. The procedure and 
its logical basis have been developed by Berkson [1953] to whose papers 
the reader is referred. The second alternative, the inverse sine or angular 
transformation, is an equally good approximation to the normal over 
the same range as the logit (Claringbold et al [1953]; Knudsen and 
Curtis [1947]). It is justified by statistical convenience rather than by 
a biological model. If the groups are equal in size, the information in 
each angle is independent of its observed or expected value, and hence 
weighting coefficients are unnecessary. In consequence, an assay can 
be designed and analyzed factorially, with little change from a graded- 
response assay. If responses of 0 or 100 percent are at all frequent, or 


if interest centers in the ends of the response range, probit analysis 
would be preferred. 


Analysis in angles 


The angular transformation will be considered here in relation to 
factorial assays, although it can be extended readily to less well-designed 
experiments. A constant number of test animals (n’) is assigned to 
each of the k = 2 or more doses of the Standard and of the Unknown, 
spaced equally in log-units. Doses are selected that will produce 
similar reactions within the range from 10 to 90 percent, so far as this 
can be predicted in advance. 


7 
> 
q 
“Thy, 
ot; 
ts 
at 
he 
ts) : 
en 
the 
i 


514 BIOMETRICS, DECEMBER 1956 


For analysis, each percentage reaction in the intermediate zone is 
converted to its empirical equivalent angle y, which extends from 
0° to 90° for percentages of 0 to 100 (Bliss and Calhoun [1954]; Finney 
[1952b]; Fisher and Yates [1953]). A reaction of 0 (or 100) percent dis- 
torts the curve less, however, if its empirical angle is replaced by a 
working angle, as illustrated below. Each such estimate is a function of 
its expected value, which depends, in turn, upon observations in the inter- 
mediate zone. When these expectations are less than 0° or more than 
90°, observed percentages of 0 and 100 contain no usable information 
and are omitted. For a maximum likelihood solution, all empirical 
angles would be replaced by working angles in successive iterations, 
but in view of the empirical basis of the transformation this refinement 
is here of doubtful value. 

The subsequent calculation is very similar to that for a graded- 
response factorial assay. Each angle y is multiplied by its corresponding 
factorial coefficients x, or x, , and the products summed to obtain 
T. = >, (x,y) and T, = >> (x,y), from which the log-relative potency 
M’ is computed by Equation (16). 

The error variance of y has the expected value o” = 820.7/n’, 
where n’ is the number of individuals per group and is the same in 
all groups. If n’ were to vary between groups by less than 10 or 20 
percent, its harmonic mean could be used to compute an acceptable 
o°. Dividing a sum of squared deviations by o’ converts it to x’ for 
testing agreement with the assumptions underlying an assay. The 
observed variation of the y’s about the fitted parallel lines totals 


with n = >> k — 3 degrees of freedom, from which an approximate 
x’ = [y’]/o”. If x’ is not significant or near significant, the slope factor 
C is computed by Equation (17), substituting o” for s’, and with f = 1 
and # = 3.841. Given C and M’, no other change is required in com- 
puting L by Equation (18). However, if x’ is significant or approaches 
significance, the dosage-response curves for the Standard and the Un- 
known are tested for divergence in slope and for single curvature. 
T, = >> x,y is computed with the factorial coefficients in rows i = ab, q 
and aq of Table 1 or 2, to obtain for each row x’ = T%/e,o” with one 
degree of freedom. If these explain the apparent heterogeneity, so 
that the residual x? = [y7]/o? — >> (T7%/e,o") is not significant, the 
validity of the assay is in doubt. Otherwise, the heterogeneity may 
reflect departures from random binomial variation that are not critical. 
These might occur, for example, if the individual test animals were not 
assigned to treatment groups entirely at random, although the treatment 


Ha 
' 
ik 
‘4 
2 


CONFIDENCE LIMITS 515 
groups were later assigned to doses at random. C may then be computed 
by Equation (17) with the empirical error variance s’ = [y’]/n, Stu- 
dent’s ¢ for n = > k — 3 degrees of freedom, and f = 1. 

The calculation with angles may be illustrated by an assay of the 
relative toxicity of two insecticides, nicotine and lindane, in the adult 
milkweed bug, Oncopeltus fasciatus (Turner {1955]). Adult bugs of 
the same age were anaesthetized by carbon dioxide, injected individually 
with insecticide, and held for 48 hours before determining mortality. 
From earlier experiments, the Unknown (lindane) was assumed to be 
40 times as toxic as the Standard (nicotine), and on this basis the two 
toxicants were injected at six corresponding dosage levels, varying in 
concentration by a constant factor of V2. 24 bugs were injected at 
each dosage level of each toxicant in concomitant tests, leading to the 
percentage kills in Table 7. 


TABLE 7 


ANALysIs IN ANGLES OF A BALANCED ALL-OR-NONE ASSAY OF THE TOXICITY OF 
LInDANE ReEwativE TO NIcoTINE WHEN INJECTED INTO ADULT MILKWEED 


Bucs 1n Groups oF n’ = 24 (TuRNER [1955)) 
Coded log-dose for Percent kill Kill in angles, y 
5 doses 6 doses Nicotine Lindane Nicotine Lindane 
z’ S U S U 
—2 —5 8.3 16.7 16.7 24.1 
-1 —3 20.8 25.0 27.1 30.0 
0 -1 37.5 62.5 37.8 52.2 
1 1 62.5 79.2 52.2 62.9 
2 3 83.3 83.3 65.9 65.9 
(3) 5 100 100 86.5* 


*Maximum likelihood estimate (from Fisher and Yates [1953] Table XIV) with 
expected angle ¢ computed as ¢ = 7 + b’zx’, where b’ = T,/e, = 12.0, x’ = 3, 
and g = 39.94 for S and 47.0 for U, giving ¢s = 75.94 and gg = 83.02. 


i = .1505, Zs — fy = 1.602, o? = 820.7/24 = 34.196, @ = 3.841 


5-dose assay: T, = 35.4, T, = 240.0, c = 4, M’ = .08880 (Eq. (16)), M = 1.6908 
(Eq. (10)); [y*] = 104.24 (Eq. (32)), x? = [y*]/o? = 3.05, n = 7; C = 1.04779 
(Eo. (17)), ca? = .18120, Xy- = .09304 + .09515 (Eq. (9)), Xap = 1.5999, 1.7902 


6-dose assay: T, = 38.8, T, = 893.2, c = 35/3, M’ = .07627 (Eq. (16)), M = 1.6783 
(Eq. (10)); [y*] = 136.84 (Eq. (32)), x? = 4.00, n = 9; C = 1.02359 (Eq. (17)), 
ca? = .26425, Xy- = .07807 + .07984 (Eq. (9)) Xz = 1.6002, 1.7599 


Since the largest dose of each toxicant killed 100 percent of the bugs, 
relative toxicity was first computed from the five smaller doses. Apply- 
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ing the five-dose factorial coefficients in Table 1 gave the sums of 
products 7, and 7, , leading to the log-potency 17. From x’, the ob- 
servations agreed very well with the fitted parallel lines (P = 0.88), 
well within the requirements for assay validity and homogeneity. 
Hence C was computed with the expected variance oc” = 34.196, leading 
to the log-confidence interval given in the table. 

The assay was then recomputed from all six dosage levels, w ith 
provisional maximum likelihood estimates for the two kills of 100 
percent. From the combined slope based upon the five lower doses for 
both toxicants of b = 240.0/20 = 12.0, the expected response at the 
highest dose (z’ = 3) was ¢s = 39.94 + 12.0 X 3 = 75.94 for the 
Standard and ¢y = 83.02 for the Unknown. With these “expected 
values” and a table. of the maximal working angle (Bliss and Calhoun 
{1952]; Finney [1926b]; Fisher and Yates [1953]), the required angles 
were y = 83.1 for the Standard and y = 86.5 for the Unknown. When 
al eee with the six-dose factorial coefficients from Table 1, x’ 
again indicated satisfactory agreement with hypothesis. Including the 
100 percent kills changed the estimated log-relative potency by 14 
percent and shortened the confidence interval by 16 percent. Whether 
these new estimates could be considered an improvement over those 
limited to the intermediate kills would be judged from their agreement 
with probit analysis, as described in the next section. 


Analisis in probits 


Because they are based directly upon the model, probits are additive, 
a characteristic which takes precedence over equality of the variance. 
Dosages can be selected from the zone of most interest to the experi- 
menter, although the slope has its greatest precision when the response 
ranges from 7-10 to 90-93 percent. Each percentage reaction between 
0 and 100 percent is transformed initially to its empirical probit y’ 
and plotted against the log-dose x. The responses of 0 and 100 percent, 
which are — © and + © in empirical probits, may be indicated by small 
vertical urrows. 

For determining the “expected” probit Y at each dose, provisional 
dosage-effect curves are usually drawn by inspection, which limits 
their precision. If the doses are spaced at equal log-intervals and coded, 
parallel unweighted curves can be computed easily and objectively, 
initially from the empirical probits in the intermediate zone. For each 
reaction of 0 or 100 percent, a temporary ‘‘expected” probit Y is calcu- 
lated from the initial equation, and with this Y a preliminary working 
probit. The provisional curves are then recomputed, including these 
new values but again without weighting, and solved for each coded 
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log-dose x’ to obtain the expected probits Y required for an estimate 
by maximum likelihood. Even with these precautions, the y’s are still 
provisional estimates, to be replaced in successive weighted approxima- 
tions which converge to the maximum likelihood solution. By providing 
a better starting point, however, they should reduce the number of 
iterations required for an acceptable precision. 

Given the expected probit Y for each group of n’ individuals, its 
working probit y = (Y — P/Z) + p/Z and weight w = n’(Z’/PQ) 
are determined with the aid of suitable tables (Finney [1952a]; Finney 
and Stevens [1946]; Fisher and Yates [1953]). The combined slope of 
the best-fitting set of parallel lines is computed in terms of 2’, y and w as 


where [wx’?] = — w and [wr’y] = (wr'y) — 
7. (wx’) >> (wy)/>> w for each curve. The extent of any divergence in 
slope can be measured by x, , determined as 


B (34) 
where = [wz'y]?/[wz’’] for each curve and B? = [wr”]. 
The slope b’, the difference in the weighted mean responses, Jy — gs , 
and the weighted mean coded log-doses €{ and #% are substituted in 
the equation for the log-relative potency to obtain 


M’ = — + Ju — Gs)/b'} (35) 
where 7’ is the interval in logarithms corresponding to a unit difference 
in x’. Unlike the balanced factorial assay with equally-weighted y’s, 
the difference #{ — #% is seldom exactly zero. M’ is converted to a 
log-potency M by adding log R (Equation (10)). 


Agreement of the working probits y with the fitted parallel straight 
lines, h’ in number, may be tested by the 


assay x” = [wy’] — B’ (36) 


where [wy?] = >> (wy) — >>? (wy)/D w for each line. As a convenient 
empirical rule, the assay x’ is assumed to have n = k — h’ — 1 degrees 
of freedom, where k is the number of groups with both an expected 
response and an expected non-response of at least one-half individual. 


If the assay x’ is neither significant nor approaches significance, we 
compute 


C = B’/(B’ (37) 


with ? = 3.841 for n = © and P = .05. The assay x” may reveal 


heterogeneity among the test groups although no consistent curvature 
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where = i’? >> [wx”] if the log-doses have been coded. 
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or non-parallelism invalidates the assay. C is then computed by 


Equation (14), with s* = (assay x’)/n, and # for n degrees of freedom. 
In either case the confidence interval is computed by Equation (9) with 


= + [ua] (38) 


These equations are readily extended to other probit analyses. For 
a selected level of response Y, such as Y = 5 for 50 percent, the cor- 
responding log-dose Xo may be computed from a single log-dose effect 
curve as X59 = = + (Y — g)/b = & + X’. The slope factor C is then 


determined from the data for one curve, leading to the confidence 
interval 


Xx = CX’ + V(C — 1)(CX” + w) (39) 


In another application, the chemotherapeutic index (C.I.) of a 
drug may be defined as the ratio of its LD5, the dose causing 5 percent 
damage or death, to its KD95, the dose producing a characteristic 
therapeutic response in 95 percent of individuals of the same species 
under similar conditions. This is a difference in response of D = 3.290 
probits. If the respective dosage-effect curves have the same slope b 
within the sampling error, we may compute the logarithm of the chemo- 
therapeutic index as 


log (C.1.) = — — (G2 — + D)/b (40) 


where the subscripts , and , refer to the toxic and therapeutic effects 
respectively. .The confidence interval for log (C.I.) is the same as that 
for an all-or-none assay except that M’ in Equation (9) is replaced by 
log (C.I.). 

The example in Table 7 has been recalculated in probits with coded 
log-doses x’ (Table 8). Each percentage dead in the intermediate zone 
was first converted by an appropriate table to its empirical probit y’. 
Two parallel lines were then fitted without weighting to the 5 paired 
values of x’ and y’ for each toxicant, with the common slope b = 0.5605 
and the intercepts 7g and 7% at x’ = 0. Temporary expected probits 
of 6.44 for the Standard and 6.77 for the Unknown were computed 
from these equations at z’ = 3, leading to the provisional working 
probits y’ = 6.97 and 7.23 respectively, so that each z’ was paired with 
an explicit y’. 

Parallel, unweighted lines were recomputed from the six pairs of 
z’, y’ for each toxicant in place of the customary graphic estimates, 
and then solved for each z’ to obtain the expected probit Y for each 
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TABLE 8 
ANALYSIS IN PROBITS OF THE ALL-OR-NONE AssAYy IN TABLE 7 


Iem- | Ex- Second 
Toxi- | Coded} Dead,} pirical] pected} Weight, | Working Products weighted 
cant | dose probit,| probit, probit, estimate 
% y’ w y 
wa’ wy w y 
Nico- | —2 | 8.3 | 3.61 | 3.55 6.8 3.62 |—13.6 24.616] 6.8 3.62 
tine —1 | 20.8 | 4.19 | 4.18 11.9 4.19 |—11.9 49.861]11.8 4.19 
S O | 37.5 | 4.68 | 4.81 15.1 4.68 0 70.668)/15.0 4.68 
1 | 62.5 | 5.32 | 5.44] 14.2 5.32 14.2 75.544/14.4 5.32 
2 | 83.3 | 5.97 | 6.07 | 10.0 5.96 20.0 59.600)10.3 5.96 
3 | 100 | 6.97*| 6.70 5.0 7.17 15.0 35.850) 5.3 7.13 
Total 3 30.74 63.0 23.7 316.139)63.6 
Lin- —2 | 16.7 | 4.03 | 3.87 9.5 4.05 |—19.0 38.475) 9.5 4.05 
dane —1 | 25.0 | 4.33 | 4.50 | 13.9 4.33 |—13.9 60. 187}14.0 4.33 
U O | 62.5 | 5.32 | 5.13 | 15.2 5.32 0 80.864/15.2 5.32 
1 | 79.2 | 5.81 5.76 | 12.3 | 5.81 12.3 71.463/12.4 5.81 
2 | 83.3 | 5.97 | 6.40 7.2 5.82 14.4 41.904) 7.5 5.85 
3 |100 | 7.23*| 7.03 | 3.0 | 7.45 9.0 22.350] 3.1 7.42 
— 
Total | 3 32.69 | 61.1 2.8 315.243/61.7 
| 


Unweighted, omitting x’ = 3: }o2’y’ = 11.21, b = .5605, gs’ = 4.754, gy’ = 5.092 
at z’ = 3, Ys = 6.44 and Yy = 6.77, giving provisional estimates* 

Unweighted, including x’ = = 3, [z’2] = 17.5, = 35, = 22.095 
b = .6313, Ys = 4.808 + .631z’, Yy = 5.133 + .6312’ 


First weighted estimate Second weighted estimate 
Statistic 
S U Both S U Both 
.37619 .04583 33036 40094 .05997 .34097 
7] 5.01808 | 5.15946 .14138 | 5.03212 | 5.16823 13611 
[wr'?] {129.384 {119.872 (249.256 |132.076 {122.078 (254.154 
[wzx'y] 84.273 70.738 {155.011 85.758 72.108 {157.866 
b .62189 .62114 
B 54.890 41.743 96.401 55.683 42.592 98.057 
[wy?] 56.498 45.735 57.175 46.378 
x? 1.608 3.992 |x? = .232/ 1.492 3.786 |x? = .218 
Assay x? 5.832, n = 9 5.496,n =9 
M',M .08393, 1.6859 .08430, 1.6863 
C, vea/ve5 1.04150, .18202 1.04077, .18381 
08865 -08829 
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treatment. Given the Y’s and tables of the weighting coefficients 
(Z°, PQ), the minimum probit (Y — P?/Z) and range (1/7), the weights 
wand the working probits y followed readily. Two columns of products 
completed the initial part of the computation. 

The calculation of the first weighted estimate is shown in the lower 
part of Table 8, leading to .W’ = 0.08393 and L = 0.1773 by Equations 
(33)-(38). With assay x*° = 5.832 for 9 degrees of freedom, the assay 
proved satisfactorily homogeneous. The approximation has been 
carried a stage further by computing improved Y’s from these weighted 
curves, leading to new weights w and working probits y for a second 
weighted estimate. Nine of the 12 y’s in this second approximation 
were unchanged and the remaining three differed so little, that the 
second weighted 1/’ = .08430 agreed within one-half percent with the 
first weighted estimate. The assay x’ dropped to 5.496 and the con- 
fidence interval L to 0.1766, all representing so little change as not to 
warrant a third approximation. By starting with an unweighted, 
computed estimate, the first weighted calculation can often be con- 
sidered a very fair maximum likelihood estimate, even though the 
weights may vary as much as fivefold among themselves. 

The computed probit estimates were approximated more closely by 
the angular calculation when the latter was restricted to the interme- 
diate kills. This is consistent with limiting the calculation in angles 
to the zone where they most nearly parallel the probits. When the 
doses killing 100 percent were included, and the computation was con- 
tinued to a maximum likelihood estimate parallel:ng that for probits, 
successive values of J/’ were 0.07627, 0.07260 and 0.07188. Since the 
angular transform is justified theoreticaily by its approach to the probit, 
a complete maximum likelihood solution which includes 0 and 100 
percent kills has little to recommend it. 


THE CONFIDENCE INTERVAL FOR THE RATIC OF TWO MEANS 


As noted before, when the standard deviation of the threshold dose 
is a small enough fraction of its mean, a normal and a log-normal 
distribution cannot be distinguished empirically. This is sufficiently 
true of the pigeon assay for digitalis that its potency is computed in 
U.S.P. XV directly from the ratio of the mean threshold doses for the 
Standard ds and for the Unknown Zé, as 


(41) 


where v is the threshold dose killing an individual pigeon and R is the 
ratio of the number of ml. of the Standard preparation to the number of 
ml. of the Unknown preparation, each in 100 ml. of their respective 


é 

t 

r 

I 

t 

‘ ir 

al 

It 

| is 

en 

th 

at 

| co 

de 

| in 

; 


CONFIDENCE LIMITS 521 


dilutions. Since each pigeon is assigned to a preparation at random and 
titrated separately, the numerator and denominator of P, are inde- 
pendent and hence have zero covariance. The pooled variance of v, 
s’, is computed by Equation (2), substituting v for x. The error variance 
of dy in the denominator of P, is s°/Nv = v,,. Substituting in Equa- 
tion (8), we have C = i¢/(éu — s°t?/Ny), leading to the confidence 
interval 


L = 2V(C — 1)(CPy + R’fu/fs) (42) 
from which Xp. = CP, + 3L. 


SLOPE-RATIO ASSAYS 


In some assays, the response y can be plotted as a straight line 
against arithmetic dosage units. The linear relation is usually restricted 
to the smaller dosage levels, and depending upon the nature of the 
response, the slope may be either positive or negative. When the 
plotted lines for the Standard (subscript 1) and for each of the m — 1 
Unknowns (subscripts 2 to m) meet at zero dose within the sampling 
error, the relative potency is given by the ratio of their slopes. Al- 
though this is not essential, the analysis is considerably shortened 
when the k doses of each preparation are spaced at equal arithmetic 
intervals and coded, such as to x = 0, 1, 2, 3, --- , k (Barraclough 
[1955]; Bliss [1946], [1952b]; Finney [1952b]; Wood [1953]). The slopes 
are computed with a common intercept a’ at x = 0, so that 


Y, = a’ + bx, , Yo = a’ + bot, Y, =@’ + bez. (43) 


In coded units, the relative potency P{ , for Unknown 2, for example, 
is the ratio 


P! = b/b, = a/b (44) 


To recover the potency P, in original units, Py is adjusted for differ- 
ences in the coding units by multiplying by R = Is/Iy , the ratio of 
the dosage intervals. 

When the assay is fully balanced, with the same number of replicates 
at each of the k dosage levels, the inverse matrix of c,;’s required for 
computing the intercept a’ and the slopes b, , b. , --- , b, need not be 
determined explicitly. If control responses at z = 0 have been included 
in an experiment with a total of km + 1 doses and each treatment has — 


f replicate responses y which total T, , the common intercept is deter- 
mined as 


F(2@k + 1) + mkk — (45) 
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where >> is the total of all N = f(mk + 1) responses and Tj = > (xT) 
for cach preparation. In the absence of a zero dose, N = fmk and 
the equation for a’ reduces to 


fmk(k — 1) 
In either case, the slope for each preparation is computed as 
b= i +1) (46) 


Tests are available (Barraclough [1955]; Finney [1952b]) for deter- 
mining whether the dosage-response curves for all preparations are 
satisfactorily linear and have a common intercept within the sampling 
error. The variation about these lines is assumed to be independent 
of dose and of preparation, a condition which is usually met more 
easily as each P, approaches unity. If the assay meets these require- 
ments for validity and the response units have been placed and handled 
entirely at random, the error variance about the m lines can be deter- 
mined as 


(47) 
where the degrees of freedom n = N — m — 1. 

Many experimenters handle replicate tubes together throughout 
an assay, and in this case the variation among the f replicates within 
treatments may underestimate the assay error. The error variance, 
adjusted to units of a single tube, is then computed from the variation 
of the treatment totals (7',) about the fitted lines as 


8 = {LV (48) 


with n = m(k — 1) degrees of freedom. 

With the matrix of c;;’s and the error variance s’, the variances and 
covariance of the two slopes which form the numerator and denominator 
of P{ can be obtained in the general case (Bliss (1946]) as v,, = cus’, 
Vee = C28’, ARd Va, = C128’. These terms are substituted in Equation (7) 
to obtain C = bi/(bi— ¢,,8°t’) and in Equation (8) to obtain 4Z and the 
confidence limits of P{ as 


Xp, = CP, — K + V(C — + + K(K — 2CP{) (49) 


where K = (C — 1)ce,2/c,, and Student’s ¢ for the required level of 
significance depends upon the degrees of freedom in s’. 

In the balanced assay described above, with equal intervals between 
doses and a constant number (f) of replicates for each preparation, the 
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confidence interval can be computed readily. The ratio c,2/c,, = 1, 
and the terms needed for computing C and K have simple forms. With 
f values of y at x = 0, the required constants are 


6 1 3 
and 


3k(k + 1) 
(8k + 1)(k + 2) + mk(k — 1) (51) 


If there is no observation at « = 0, these reduce to 


ou = +1) \kK +1 mk 
and 
Ci2 3(k + 1) 


+ 1) + mk — 1) 


The slope-ratio design has its main application in microbial assays 
of vitamins and amino acids, and potentially in biochemical research. 
With the above equations, exact confidence limits are no more difficult 
to compute than the usual approximations. 


TABLE 9 


CaucuLaTIon or Retative Porencies P{ anp Conripence Limits X 
IN THE Assay OF Niacin CiTep By Barrac [1955] 


Preparation 
Term 

1 2 3 4 5 
Ti = |80.1 65.5 67.8 65.7 65.8 
b; (Eq. 46) 1.45714 .93571 1.01785 .94285 94642 
Ps (Eq. 44) 64216 .69853 .64706 64951 
CPL .64370 . 70052 64864 65111 
K(K — 2CPJ)| | — .00466 — .0U507 — .00470 — .00472 
(Eq. 48) .0818 .0831 .0819 .0820 
Xp,- (Eq. 48) .562,.726 .617, .784  .567,.731 .733 


= 168.4, = 344.9, k = 3,m = 5, f = 2, a’ = 3.2750 (Eq. 45) bs = 
{7/12 — a’} (Eq. 46), )7?/2 = 923.270, s? = .05242 (Eq. 48), n = 10, t? = 4.965 
(at P = .05), cu = .06493 (Eq. 50), C = 1.008024 (Eq. 7), ci2/en = .45000 (Eq. 
51), K = .003611 
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The calculation may be illustrated with the data from the balanced 
niacin assay cited by Barraclough [1955]. Each of m = 5 preparations, 
a Standard (preparation 1) and four Unknowns (preparations 2 to 5), 
was tested at k = 3 dosage levels with f = 2 replicates. The assay 
included two zero-level tubes. The totals 7’, of the 3 paired measure- 
ments for each preparation were multiplied by the coded doses, z = 1 
to 3, to obtain the T% in the first row of Table 9, leading to the values 
of b; and Py in the next two rows. The preparations proved satis- 
factorily homogeneous in individual tests for validity. In addition, the 
mean square for scatter about the five fitted lines divided by that 
between duplicates gave a non-significant F = 1.18 (n, = 10, n. = 16). 
However, since F > 1 and randomization was not reported explicitly, 
the error variance s’ = .05242 has been computed here by Equation 
(48), leading to C = 1.008024 and the confidence intervals X p,. in the 
last row of the table. 


SUMMARY 


Bioassays are of two types, those in which the threshold dose of a 
given drug is a dependent variable, measured directly in each test 
animal, and a far larger class in which dose is the independent variable 
and the size of the reaction is the measured dependent variable. In the 
few assays of the first type, the log-potency is computed from the 
difference between two means or from a mean difference; its confidence 
limits are based upon familiar assumptions and equations, which are 
reviewed briefly. In the great majority of assays, the potency or its 
logarithm is computed from the ratio of two statistics and its confidence 
limits are the roots of a quadratic equation. This equation in an 
especially simple form, which was proposed originally by Marks for 
the cross-over assay of insulin, has been generalized for all assays based 
upon a ratio, including those with a significant covariance between 
numerator and denominator. Some of its applications are illustrated 
with examples of vitamin, hormone and insecticide assays. 

The general form for limits without covariance has been adapted 
for selected parallel-line assays, many appearing in U.S.P. XV. These 
include balanced factorial assays with one Unknown; partially balanced 
factorial assays, where the Standard and Unknown differ by a missing 
end dose but are otherwise equivalent; factorial assays with unequally 
spaced log-doses, which occur frequently in microbial assays; assays 
with more than one Unknown tested concomitantly with the same 
Standard; and assays in balanced pairs, such as the twin cross-over 
assay for insulin. In each of these, the numerator and denominator 
are uncorrelated by virtue of the design. 


: 
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In several parallel-line assays, potential correlation between numer- 
ator and denominator is usually negligible and can be ignored in com- 
puting the confidence interval. Among these are assays in randomized 
sets, where the sets may differ significantly in the slope of the log-dose 
response curve; factorial assays in which missing responses are replaced 
later by calculation; and assays from the successive reactions of a single 
animal or piece of tissue, in which the responses to a constant dose of 
Standard are converted to moving averages when calculating the log- 
potency. In this last case, the approximate equations proposed for the 
U.8.P. XV posterior pituitary assay have been tested by experimental 
sampling. 

The confidence interval for all-or-none assays depends in part 
upon the transformation adopted for the percentage response, although 
covariance is not a complication. With the more convenient angular 
transformation and responses in the range from 10 to 90 percent, the 
simple equations for balanced factorial designs are applicable. The 
additive probit transformation follows directly from the underlying 
mathematical model and imposes no theoretical limitation upon the 
range of response, but unequal weighting and an iterative solution 
increase the amount of computing. If the assay is designed factorially, 
the log-doses can be coded and objective expected probits computed 
from a preliminary analysis in unweighted probits. These lead to a 
weighted maximum likelihood solution with appreciably less calculation. 

In assays based upon the threshold dose, potency can be computed 
directly from the ratio of two mean doses without the use of logarithms, 
as in the U.S.P. pigeon assay for digitalis; its confidence interval is 
that for a ratio without covariance. Logarithms are also avoided when 
the response is a linear function of the dose rather than of the log-dose, 
and potency is determined from the ratio of two slopes as a slope-ratio 
assay. The numerator and denominator of this ratio are always cor- 
related, so that covariance enters unavoidably into the calculation of 
its confidence interval. Relatively simple equations are presented for 
balanced assays with several Unknowns and the same number of 
replicates for each treatment. 
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QUERIES 


GeorGcE W. SNEDEcOR, Editor 


QUERY: In Biometrics 11, No. 3 (1955) G. A. MelIntyre analyses 

123 the results of a complex experiment in which 4 treatments are 
applied in two latin squares to 4 different test plants and leaves 

in 4 positions in the test plants. The results of treatments are then 
assayed in 4 Graeco-Latin squares each with 4 assay plants, 4 positions 
of leaves within the assay plants, and two treatments on the two halves 
of each leaf. 
Clearly the variance component of in line 15 of table 2 has been 
obtained as half the difference of the mean squares in the lines 10 and 14. 
We can, however, obtain an estimate of cf in another way. The ex- 
pected values of the mean squares in the lines 8 and 9 are identical and 
the mean squares can be pooled; and likewise for lines 12 and 13. This 


gives 
D.f. Pooled mean squaré 
Line 8 and 9 24 38.41 
Line 12 and 13 24 18.52 
Difference 19.89 


Now it will be seen that the difference between the expected values in 
lines 8, 9 and 12, 13 is again 203 , so that we obtain a second estimate 


ok = 19.89/2 = 9.95 
nd instead of the value of 2.10 of McIntyre. They differ almost by a 
factor 5; which estimate is the correct one? 


This question raises a more general problem. In Mcintyre’s Table 1 ; 
- we find 14 different mean squares. The mean squares in lines 1 and 11, 
8 and 9, and 12 and 13 have identical expected values and can therefore 


zed be pooled. Even then, however, 11 mean squares remain against only ~ 
; 8 variance components, so that these variance components are not 
| uniquely determined. 
ded Usually one finds in the textbooks that in an analysis of variance the 
17: | number of mean squares is equal to, or with confounding less than, the 
number of variance components, and I am inclined to the opinion that 
bon | this should always be so. If this opinion is correct we must conclude 
onn. 


that the analysis carried out by McIntyre is not correct and can be 

7g: | improved, but the experiment is of so complicated a nature that it is 
not easy to see what is the correct way and how to find it. If I am 
wrong, that is if the number of mean squares can be larger than the 
number of variance components, then the question to be solved is how 
these variance components should be estimated. 
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The analysis recorded in ‘Design and Analysis of Two 
ANSWER: Phase Experiments’ has given unbiased estimates of 
treatment effects and an appropriate estimate of treatment 
error. The pooled analyses of the variation within Graeco-Latin 
squares together with the analysis of the Latin squares has given 


unbiased estimates of location components adequate for the purpose — 


of illustrating the effect of modification of the design. 

However, the estimates of components are not the most efficient. 
For instance, as indicated by the querist, further information on oj 
can be obtained from the analysis of variation within Graeco-Latin 
squares. The variation between letters within an alphabet in any 
square is the variation in lesions due to virus concentrations associated 
with particular combinations of light treatment effects and leaf position 
on the test plant. This variation is potentially different for each 
alphabet in each square. Those identified as Latin originate from the 
first Latin square. The components given in Table 2 of the paper refer 
to the mean square taken over all four Graeco-Latin squares when 
orthogonality of treatment and test leaf positions obtains. 

The estimate of 20 given by the querist is the average of eight such 
estimates from the eight differences of sum and difference mean squares, 
one from each alphabet in each square. These estimates are not scaled 
x’ variates but have variances which involve the associated alphabet 
as well as «2. An estimation for the variance of the average estimate 
of 2c; , subject to the usual assumptions of normality for A and e, is 


+ orr)(ox + + 803 + + 4o'] 


This involves several poorly determined components. Starting with 
the estimates given at the foot of Table 2 one can by a least squares 
iterative process arrive at a combined estimate of 2cf from this source 


and from the difference of the error mean squares for the sum and - 


difference analysis. 

Some of the information within squares on os can be recovered 
through a recasting of the analysis of the assay phase. At the same 
time this rearrangement clarifies the relation between the analyses of 
the two phases. Because these analyses have data in common there is 
some overlap in estimators for components but they are not independent. 
This overlap seems to have been the cause of the querist’s difficulties. 

’ For two phase designs of the types described in the paper and with 
replication in the assay phase the data for the first phase analysis is 
the sum of the second phase replicates deriving from material in a first 
phase plot. An analysis of the assay phase, which would usually be 
made only to estimate second phase components, will isolate certain 
degrees of freedom, sum of squares and corresponding components 
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which in total will equal the sum of the degrees of freedom, sum of 
squares and components of the first phase analysis. The remaining 
sources of variation in the assay analysis will contain only second phase 
components. 

In the numerical example the entries into the first phase design are 
the totals of the Latin and Greek letters within squares of the second 
phase design. The degrees of freedom and sum of squares and com- 
ponents between test plants can be identified with sets and plants 
within sets in the Latin analysis and with alphabets within squares and 
squares (not given in Table 2) in the Graeco-Latin analysis. There is, 
however, no correspondence in the second phase analysis for the re- 
maining 24 degrees of freedom in first phase analysis because of the 
method followed to isolate the whole plot and sub-plot errors. By 
rearrangement a correspondence can be achieved. 

The Graeco-Latin square (in the sense of Yates, not of Cochran 
and Cox, Experimental Designs) is a partially balanced incomplete 
block design with two associate classes and with grouped blocks ortho- 
gonal to treatments in two directions. The blocks are whole plots or | 
paired sub-plots. The analysis given in Table 2 is essentially the intra- 
block analysis with treatments eliminating blocks given by the sum 
of Alphabets, Latin (differences) and Greek (differences). Effects due 
to assay plants and leaf positions within plants can be removed from 


. the block sum of squares, the residual block effect ignoring treatments 


consisting of the sum of the whole plot error, Latin (sums) and Greek 
(sums). In the auxiliary analysis the sums of squares between Latin 
letters and between Greek letters are substituted for Latin (differences) 
and Greek (differences) to give treatments ignoring blocks. In the 
parallel adjustment of the residual block effects to eliminate treatments 
the whole plot error remains unchanged, the other terms being Latin 
(sums) plus Latin (differences) less Latin and Greek (sums) plus Greek 
(differences) less Greek. The rearranged analysis with components is 
given in the accompanying table. 

With this arrangement, the totals of the degrees of freedom, sum 
of squares and components for the first four terms of this analysis are 
the same as the corresponding totals of the Latin analysis, being different 
analyses of the same entries. The remaining degrees of freedom are 
associated only with assay components. The test plant components 
are estimated from the Latin analysis and the assay components from 
the mean squares corresponding to these remaining degrees of freedom. 
The whole analysis of the 127 degrees of freedom could be formally 
presented with 31 degrees of freedom for the first phase analysis and 
the remaining 96 degrees of freedom as given in the latter part of the 
auxiliary analysis. 
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The variance component of can be estimated as 3(10.60 — 7.60) 
or 2.28 in place of 3(11.80 — 7.60) or 2.10. With some slight gain in 
precision, estimates of of and o? can be derived from the three scaled 
x’ estimators of 2 + o? ; 203 + o? , and o? with 24, 12 and 36 df. 
respectively by maximum likelihood procedure. After one cycle of 
iteration the estimates are of = 2.21, 02 = 7.63. 

In estimating components in the analysis of these data it has been 
assumed that interactions among treatments and sources of variation 
from plant material can be ignored. It should also be mentioned that 
unconscious variation in technique in applying virus to the leaves 
in the assay phase may have contributed to variation in lesion counts. 
If such variation in technique is random then the effect will be con- 
founded only with o? but non-random variation would be also con- 
founded with some or all of the other location components and perhaps 
treatments. 

The supposition of the querist that in general the number of mean 
squares with different expectations in an analysis must not be greater 
than the number of components is not true for any repeated incomplete 
block design since there are two or more estimates with different expec- 
tations involving only block and plot effects. Even with a single 
classification one can have an analogous case. 

Suppose that a random sample of strains of a species is tested for 
variation in yield, using a completely randomised arrangement. The 
strains are randomly divided into two groups, the k, members of the 
first group being replicated r, times, the k, members of the second 
group r, times. The data could be analysed on Model II as below into 
four contrasts, each involving one or both of two components of varia- 
tion. On the assumption that both the strain means and replicate 


elements of variation are normally distributed these mean squares are 
scaled x’ variates. 


Variation Degrees of freedom Expectation of mean square 
2 
riro(ky + 
Between groups 1 1 
Within group 1 k, -1 1 
Within group 2 kp 1 
Replicate error k(r, 1) + — | 1 
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The mean square between strains taken as a whole has an expecta- 
tion of 


f riky 2 2 
ky m ke | + Take Tok» + 

which is not distributed like x’. The conventional procedure would be 

to estimate o% from this in conjunction with the replicate error mean 

square. A more efficient estimate could be obtained from the four 

mean squares of the analysis by maximum likelihood procedure. 


G. A. McINTYRE 
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ABSTRACTS 


Papers presented at joint Biometric Society (ENAR), A. S. A. and I. M. S. Sessions, 
Detroit, September 7-10, 1956. 


DAVID W. ALLING (Hermann M. Biggs Memorial Hospital, 
389 Ithaca, New York). The After-History of Pulmonary Tubercu- 
losis; A Stochastic Model. 


Patients with pulmonary tuberculosis were classified at the time of 
diagnosis and at succeeding annual intervals as being in one of the 
following three clinical states: 


R_ having arrested tuberculosis 
A having active tuberculosis 
D being dead of tuberculosis 


By suitable division of these three states into six substates a discrete, 
stationary stochastic model is constructed which is found to correspond 
reasonably well with sets of observed data. 


390 IRA A. DeARMON, JR. Control of Precision in the Plate - 


Count Assay. 


In estimating the number of viable organisms in a sample concen- 
tration of bacteria, the measurement consists of the plating operation 
and the serial dilution operation. The variability of each of these 
- operations has been evaluated for a fastidious microorganism and has 
been shown to be consistent with that variability expected in a Poisson 
distribution. 

Utilizing the x’ distribution, a control procedure has been suggested 
whereby both the plating operation and the serial dilution operation 
can be assessed for excessive variability on the same control chart. 
_ This control procedure has been applied with success to an actual 
program carried out in a biological laboratory. 


391 D. B. DeLURY (Ontario Research Foundation). Elements 


of the Analysis of Covariance. 


The point of view followed throughout the paper is that the analysis 
of covariance is simply regression theory adapted to data which contain 
a considerable amount of orthogonality. Two examples are discussed, 
to bring out a little of the algebra by which standard regression theory 
is organized in the covariance pattern and some of the arithmetical 
technique that goes with it. One example involves two samples, each 
with several observations, and one concomitant variable, the other is 
based on an orthogonal experiment with two concomitant variables. 
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CHARLES W. DUNNETT (Lederle Laboratories, American 
392 Cyanamid Company, Pearl River, New York). Multiple De- 
cision Procedure. 


This is an expository paper describing the use of multiple decision 
procedures as an alternative to the analysis of variance in problems 
concerned with selecting the ‘‘best’’ of several treatment categories. 
The particular procedures described are those due to Bechhofer (Ann. 
Math. Stat., 25: 16-39) and Somerville (Biometrika, 41: 420-429). 


MINDEL C. SHEPS AND PAUL L. MUNSON (Harvard Uni- 
393 versity). The Role of Between-Assay Error in Biological 
Assays. 


Discrepancies in relative potency estimates within a biological assay 
method may be due to deviation from one or more of the assumptions 
implicit in the assay model. In a chick comb assay method for andro- 
genic potency, observed variation between replicate M’s for urine 
extracts had been three times as great as predicted. No systematic 
causes were apparent from a study of the data. 

A change in technique that simplified the problems of handling the 
material to be applied to the comb resulted in only slight changes in 
the intra-assay variability, while the residual variance and assay 
slopes were still heterogeneous. However the observed variation of 
the M’s for urine extracts now agreed very well with the intra-assay 
predictions. .W’s for steroids of varying chemical structure had an 
observed variance 1.7 times as great as predicted. 

An experiment was performed consisting of 4 assays (replications) 
comparing the effects of 4 equally spaced log doses of androsterone 
(the reference standard), a urine extract (L25) and the steroid methyl 
testosterone (MT). In a combined analysis of variance the only signifi- 
cant interaction found was for assays by differences in the slope of MT 
as compared with the other two slopes. The observed variation in M’s 
for both unknowns was not significantly different from that predicted. 

The M’s obtained with a parathyroid assay method were also not 
excessively variable despite the marked variation in residual s° and, 
in many of the assays, poorly determined slopes. 

The present data as well as numerous reports in the literature suggest 
that inter assay variation, both in biological and chemical assay methods, 
can be detected only if specifically sought for. When present, it should 
be included in estimates of error. When it is due to a cause other than 
non-parallelism it is often possible to diminish it markedly or to eradicate 
it completely. 
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394 R. LOWELL WINE (Virginia Polytechnic Institute). On the 
Comparison of Multiple Test Procedures. 


Testing the homogeneity of a set of means by the F-test may result 
in the conclusion that they are not all alike, but it fails to signify any 
arrangement of distinguishable groups among the means. The problem 
of separating a group of heterogeneous means into subgroups of means 
said to be non-heterogeneous has been approached in several different 
ways by a number of research workers. This has caused many workers 
outside the field of statistics, who in the past have used the least signifi- 
cance difference test, to wonder which of the many available tests should 
be applied in a given experiment. Using the least significant difference 
test as a starting point, some of the best multiple range tests are com- 
pared to this familiar test with respect to error rates and ease of applica- 
tion. It is pointed out that the multiple F-tests, being more difficult 
to apply, probably should not be used unless the investigator is inter- 
ested in contrasts involving more than two means. 


Papers presented at joint Biometric Society (WNAR) meetings and I.M.S. sessions, 
Seattle, Washington, August 22-24, 1956 


MOHAMED 8. AHMED (University of California, Berkeley, 
395 California). A Stochastic Model for the Tunnelling and Re- 
tunnelling of the Flour Beetle. 


The stochastic model developed to describe the tunnelling and 
retunnelling of the flour beetle is a Markov chain with only three 
states: s, , tunnelling; s, , stationary; and s,; , retunneling, and with 
continuous time parameter and_ stationary transition intensities 
q:;(t,7 = 1, 2,3). Transition from one state to any other is visualized 
with the restriction that the beetle cannot move from the tunnelling 
state to the retunnelling state, or vice-versa, except by passing through 
the stationary state. 

For this model: 1) the probability that the beetle is in state s; at 
the end of the time interval ¢ given that it was in state s,; at the beginning 
of this time interval, as well as 2) the expected time spent in a state s; 
out of the total exposure time 7, is found explicitly in terms of the 
four unknown parameters , Y23 2Nd q32. In addition, a scheme 
for the estimation of these parameters is given. 

This model can be used to study the differences in the behavior of, 
say, the male and the female beetles with regard to the proportion of 
time they spend in the various states. 


ote 
We) 
|, 
| 
| 
ae 


536 BIOMETRICS, DECEMBER 1956 


396 WALTER BECKER. An Investigation of the Log Transforma- 
tion of Growth Data. 


In order to justify transforming rat body weight data to logs, an 
investigation was made of the birth to 98 day weight data expressed 
in grams and log grams. The data in grams indicated almost complete 
proportionality of the within litter standard deviation with the un- 
weighted mean when plotted on log-log paper, also a large. significant 
interaction mean square between the two sexes and litters was found 
in the analysis of variance. Transforming to logs materially reduced 
the proportionality and succeeded in removing most of the interaction. 


P. HORST (University of Washington, Seattle, Washington). 
397 Optimal Estimates of Multiple Criteria with Restrictions on the 
Covariance Matrix of Estimated Criteria. 


In the prediction of personal adjustment a generally appropriate 
model involves a set of predictors as independent variables and a set of 
criterion or dependent variables. The conventional “least squares” 
methods for estimating the criterion variables from the predictor 
variables have long been well known. However, in problems of differ- 
ential classification it may be desirable to impose certain restrictions 
on the estimated criterion variables. One set of conditions which 
appears to be useful in this connection is that the variance-covariance 
matrix of the estimated criterion variables shall be as nearly equal as - 
possible in the “least square’’ sense to a prespecified grammian matrix. 
One such matrix is a function of specified quotas in the differential 
classification problem. It is proposed that an appropriately constrained 
“least squares” solution for the estimated criteria may provide a basis 
for differential classification which will approximate specified quotas 
and which is much simpler in application than the conventional] linear 
programming approach. The solution for the matrix of regression 
vectors with specified restrictions is derived. 


HAROLD HOTELLING (Institute of Statistics, University of 
398 North Carolina, Chapel Hill, North Carolina). New Light on the 
Multiple Correlation Coefficient. 


This paper is analogous to the author’s work on the simple correla- 
tion coefficient r (J.R.S.S., Vol. 15B, 1953, 193-232), but deals with 
problems, both mathematical and logical, of a somewhat different 
kind. Starting from Sir Ronald Fisher’s work on the distribution of 
the multiple correlation coefficient R, emendations are made regarding 


\ 
| 
| 
@ 


ABSTRACTS 537 


the geometrical derivation and the asymptotic approximation. Im- 
proved methods of calculating the A distribution of R are obtained. 
An investigation is then made of transformations of R corresponding 
to those of the simple correlation coefficient r. The problem for R is 
considerably different because R is restricted to positive values and 
has great skewness even for the case analogous to symmetry for r. 
This leads to a very different situation in such problems as discriminat- 
ing between the values of the correlation in different samples, or averag- 
ing of transformed correlations. Consideration is given to the “shrink- 
age” of R alleged by some psychologists, and to other problems arising 
in applications, including test construction and selection of predictors 
by choosing maximum multiple correlations. 


FLOYD A. JOHNSON (Pacific Northwest Forest and Range 
399 Experiment Station, U.S.A.). The Role of Statistical Methods 
in Forest Research. 


Statistical methods had a rather minor role in forest research until 
the early 1930’s. Before that time the terms statistical methods and 
calculation techniques were practically synonymous. Since that time 
the work of the English statisticians, particularly R. A. Fisher, has 
been responsible for a slow revolution in forest research procedures. 


Statistical considerations now affect all phases of the experimental © 


process; the planning phase, the field phase and the analysis phase. 
The development of inferences from experimental data is recognized 
as a statistical problem, and these inferences are no longer left un- 
qualified. Nowadays foresters in research are increasingly preoccupied 
with determining how far a sample estimate may have missed its mark 
or how much chance had to do with apparent differences among ob- 
served phenomena. 


400 LINCOLNE.MOSES. Some Useful Non-parametric Techniques 


Occasionally one may have a fully known population distribution 
F(x) and desire to test the hypothesis that he is sampling from that 
population against slippage alternatives. Such a known distribution 
will often be found tabulated in The Statistical Abstract, and not have 
a specified functional form. The test statistic proposed is 
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which may be taken as a unit. normal deviate for N > 5. The test is 
in fact Wilcoxon’s test for one sample size infinite, and so enjoys the 
same properties. 

In the experimental set up associated with Friedman’s analysis of 
variance by ranks one may have in mind an alternative hypothesis 
which makes desirable the identification of a “large treatment” (if 
any); an overall test of homogeneity as provided by Friedman’s statistic 
is not especially sensitive for this purpose. The statistic proposed is the 
maximum rank sum, which, if sufficiently large, leads to rejection of 
the null hypothesis. The asymptotic distribution of the suitably 
normalized statistic is just that of 


(X 2) 


o 


as tabulated by Pearson and Hartley. Some exact calculations indicate 
that the asymptotic values are adequate for samples of small to medium 
size as well. 


401 EARL R. RICH (University of California). A Stochastic Model 
for the Number of Beetles on the Surface of Flour. 


The problem considered is that of devising a stochastic model for 
the numbers of beetles on the surface of flour as a function of time. 
It arises in connection with analytical studies on the population dy- 
namics of the flour beetle Tribolium confusum. 

All beetles start at time ¢) on the surface of the flour, and we assume 
that for all ¢: (a) the probability that a beetle on the surface at time ¢ 
makes a transition to the interior in an interval ¢ to (¢ + A?) is \; (b) the 
probability that a beetle in the interior at time ¢ makes a transition to 
the surface in an interval ¢ to (¢ + Af) is u. 

The probability that a beetle on the surface at ¢ = 0 will be on the 
surface at ¢ may be written 


+ 


F(t, d, A+u 


Considering n beetles and assuming them to act independently of one 
another we can write that X, , the number of beetles on the surface at 
time ¢, is a binomial random variable with probability F(t, 4, wu). The 
parameters were estimated using two beetles. It was found that the 
assumption of independence is not tenable, the duration of stay in the 
states where both beetles are together being far in excess of that implied. 
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NEWS AND ANNOUNCEMENTS 


Members are invited to transmit to their National or Regional Secretary 
(if members at large, to the General Secretary) news of appointments, 
distinctions or retirements and announcements of professional interest. 


DR. J. W. TREVAN, F.RS. 


Dr. J. W. Trevan, formerly Director of the Wellcome Research 
Laboratories and temporary Director of research in the Wellcome 
Foundation Ltd., died on 13 October, 1956, aged sixty-nine. 

Dr. Trevan’s work in quantitative pharmacology at the Wellcome 
Laboratories in the 1920’s directed his attention to basic experimental 
and statistical considerations affecting bioassay. In this field he was 
an eminent pioneer and originator, and became particularly distin- 
guished for his contributions to the biological standardization of insulin. 
During his long career he received many scientific honours, including 
election as a Fellow of the Royal Society of London in 1946. He was . 
a charter member of the Biometric Society, and the Society in general 
and its British Region in particular owed much to his continuing 
kindly interest. 


Dr. W. Edwards Deming was awarded the Shewart Medal for 1955 
by the American Society for Quality Control in recognition of his 
many contributions in this field. 


Dr. David B. Duncan has joined the staff of the Institute of Statistics, 
University of North Carolina, at Chapel Hill to participate in the 
teaching and research program of the Department of Biostatistics of 
the School of Public Health and the Statistics Department. He will 
have responsibility for the graduate programs in Biostatistics. 


Dr. Arnold H. E. Grandage has been appointed Associate Professor 
in the Department of Experimental Statistics, North Carolina State 
College. He will be in charge of the electronic computing laboratory, 
and will do teaching and research in industrial statistics. 


Dr. Eugene Lukacs, formerly with the U. 8. Office of Naval Research, 
is now professor at the Catholic University of America, Washington, 
D.C. 


Dr. Frank J. Massey, Jr., is now Associate Professor of Biostatistics, 
School of Public Health, University of California at Los Angeles. 


Mr. John W. Mayne is now Director of Operational Research, 
Royal Canadian Navy. 


Dr. G. B. Oakland, after two years at Marischal College, University 
of Aberdeen, has returned to the Statistical Research and Service Unit, 
Science Service, Department of Agriculture, Ottawa, Canada. 
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Dr. Bernard Ostle, Professor of Mathematics, has been appointed 
Director of the recently formed Statistical Laboratory, Montana 
State College, Bozeman, Montana. 


Effective December 1, Professor J. A. Rigney will spend two years 
in Peru administering an Agricultural Research Program for which 
North Carolina State College has responsibility. 


Dr. Vincent Schultz is now Associate Professor of Biostatistics, 
School of Public Health, University of California, Los Angeles, Cali- 
fornia, U.S.A. 


Professor Hugh Fairfield Smith of the Institute of Statistics, North 
Carolina State College has been requested by the Statistical branch 
of the FAO of the United Nations to serve as agricultural statistician 
to the Philippines Government for one year, beginning October 1, 1956. 


It was wrongly announced in the September issue of this journal 
that Professor G. W. Snedecor was the recipient of an honorary Doctorate 
of Laws from the University of North Carolina. The degree actually 
conferred in recognition of his many outstanding services to statistics 
was an honorary Doctorate of Science, by North Carolina State College. 


INTERNATIONAL STATISTICAL INSTITUTE 


The Fifth Term of the International Statistical Education Centre, 
Beirut, opened on 25 October 1956 and will end on 10 May, 1957. 
For the Tenth term of ISEC, Calcutta, 16 July 1956 to April 1957, 
thirty students have been selected from Burma, Ceylon, India, Japan, 
Pakistan, the Phillippines, Singapore and Thailand. 


Mr. Ragnar Thorn has been appointed by the Organizing committee 
as Secretary General for the 30th I.S.I. Session which will be held in 
Stockholm, 8-15 August, 1957. 


INTERNATIONAL TRAVEL 


The National Science Foundation will award individual grants to 
defray partial travel expenses for a limited number of American scien- 
tists participating in the 30th Session of the International Statistical 
Institute or the Congress of the International Union for the Scientific 
Study of Population, scheduled to meet in Stockholm, Sweden, August 
8 to 15, 1957. Application blanks may be obtained from the National 
Science Foundation, Washington 25, D.C. Completed application 
forms must be submitted by March 1, 1957. 
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PosTGRADUATE CouRSE 


The University of Aberdeen has established a postgraduate Diploma 
in Statistics, teaching for which will begin in October, 1957. Regula- 
tions are obtainable from the University. As the number that can be 
taken initially is severely limited, prospective students resident outside 
the British Isles in particular are urged to apply for admission not 
later than January 1957. The University Reader in Statistics is D. J. 
Finney, Sc.D., F.R.S., F.R.S.E., Marischal College, Aberdeen, Scotland. 


CoNTRIBUTED Papers ENAR ANNUAL MEETING 


Members wishing to contribute papers to the annual meeting of 
ENAR to be held in Atlantic City, N. J., 10-13 September 1957, should 
send abstracts by April 1, 1957 to 

Dr. Vincent Schultz, Program Chairman, 
Annual Meeting, Biometric Society ENAR, 
Agricultural Experiment Station, 

College Park, Maryland, U.S.A. 
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Screntiric SEssIoNns 

Brazilian Region 

At a meeting on 6 July 1956, held at Ouro Preto jointly with the 
Brazilian Association for the Advancement of Science, the following 
papers were presented: C. G. Fraga, Jr., The analysis of non-orthogonal 
éxperiments; J. S. Daniel, I. R. K. Abramof and T. Silva, The numbers 
of stomata in Coffea under different experimental conditions; A. Grosz- 
mann, Analysis of maize plant growth; S. B. Henriques and F. R. 
Mandelbaum, Bioassay for the manometric determination of glutathion; 
F. P. Gomes, Factorial experiments in balanced incomplete blocks. 


Japan 


The Fourth Meeting (Third Spring Meeting) of the Biometric 
Society, Chapter of Japan, was held at Tokyo University on 12 April 
1956. Papers presented included: Statistical Analysis in Classification 
of Foxtail Millet (Sefaria italica) in Japan (S. Kitano); Comparison 
between the Efficiencies of the J.P. VI and U.S.P. XV Tests for Pyrogen 
by the Monte Carlo Method (S. Kadokawa and 8. Shintani); Studies 
on the Discrimination Method between the Chronic Hepatic and 
Non-hepatic Diseases with Application of the Linear Discriminant 
Function (K. Takahashi and E. Miyoshi); Precision of Variety Tests 
in Japan (T. Okuno); An Elementary Method of Construction of 
Punched Cards for P"- and Other Designs (M. Masuyama). 


OFFICERS 
General 


_ Beginning 1 January 1957, Dr. Allyn W. Kimball replaces Dr. C. I. 
Bliss as Treasurer of the Society. Dr. Kimball’s address is P. O. Box 
10088, Knoxville 19, Tennessee, U.S.A. 


WNAR 


At the Annual Meeting held in Seattle on 23 August 1956, the 
following were elected:—Regional President—D. G. Chapman; Regional 
Secretary-Treasurer—Elizabeth Vaughan; Executive Committee Mem- 
bers—W. J. Dixon, C. R. Li and Elizabeth Scott. 
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Regional and National Secretaries, and their current addresses are 
listed below. 


Regional Secretaries 


Australasian: Dr. G. D. Watson, Department of Statistics, The 
Australian National University, G.P.O. Box 4, Canberra, A.C.T. 
Australia. 

Belgian: Dr. L. Martin, 165 Ave. du Domaine, Forest, Bruxelles, 
Belgium. 

Brazilian: Dr. P. M. Freire, Instituto Biologico, Caixo Postal 7119, 
Sao Paulo, Brazil. 

British: Mr. E. C, Fieller, Ministry of Supply, Stat. Advisory Unit, 
Room 426, Shell Mex House, London, W.C.2, England. 

ENAR: Dr. A. M. Dutton, Box 287, Station 3, Rochester 20, 
N.Y., U.S.A. 

French: M. D. Schwartz, S.E.1.T.A., 2, Ave. d’Orsay, Paris 7°, 
France. 

German: Prof. Dr. Wilhelm Ludwig, Zoologisches Institut, Univ. of 
Heidelberg, Sofienstr. 6, Heidelberg, Germany. 

Italian: Dr. L. L. Cavalli-Sforza, Istituto Sieroterapico Milanese, 
via Darwin 20, Milano, Italy. 

WNAR: Miss Elizabeth Vaughan, 2325 Seventh Street, Bremerton, 
Washington, U.S.A. 


National Secretaries 


Danish: Mr. N. F. Gjeddebaek, Skjoldagervej 25, Gentofte, Denmark. 

Indian: Dr. K. Kishen, Dept. of Agriculture, Uttar Pradesh, Luck- 
now (U.P.), India. 

Japanese: Mr. M. Hatamura, National Inst. of Agr. Research, 
Nishigahara, Kita-ku, Tokyo, Japan. 

Netherlands: Mr. E. van der Laan, Kornoeljestraat 60, ’s-Graven- 
hage, Netherlands. ‘ 

Swedish: Dr. H. A. O. Wold, Jarnbrogatan 18, Uppsala, Sweden. 

Swiss: Dr. H. L. LeRoy, Riedthofstrasse 34, Regensdorf, ZH. 
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INTERNATIONAL SyMpostuM ON BromerricaL GENETICS, AND 
Fourtu INTERNATIONAL BromMETRIC CONFERENCE, 
Orrawa, Avaust 28-31, 1958 


Plans for this Conference, and for the Symposium, sponsored by the Inter- 
national Union of Biological Sciences, are now in being and will be published in 
1957. Meanwhile, information on the tentative scientific program may be obtained 
from the secretary of the Society, Mr. M. J. R. Healy. Queries on other pertinent 
matters should be addressed to the chairman of the Local Arrangements Committee, 
named below. 


Symposium INTERNATIONALE SUR LA BIOMETRIE GENETIQUE, ET 
QUATRIEME CONFERENCE INTERNATIONALE DE BIOMETRIE, 
Orrawa, 28-31 Aout, 1958 


Le programme de cette conférence, et du symposium (proposé par I’Union 
Internationale des Sciences Biologiques), est maintenant a l'étude et on le fera 
connaitre en 1957. On peut cependant obtenir dés maintenant des renseignements 
au sujet du programme scientifique prévu en s’addressant 4 M. M. J. R. Healy, 
Secrétaire de la société. Toutes les demandes concernant d’autres questions rela- 
tives & la Conference devront étre envoyées au Président du Comité d’Organisation 
locale (voir dessous). 


Symposium UBER BIOMETRISCHE GENETIKS, UND 
VieERTE INTERNATIONALE BIOMETRISCHE KONFERENZ, 
Orrawa, Aucust 28-31, 1958 


Plaine fiir diese Konferenz (und fiir das Symposium der Férderung der Inter- 
nationalen Vereinigung der biologischen Wissenschaften) sind in Verbereitung und 
werden im Laufe des Jahres 1957 veréffentlicht werden. Auskunft tiber das wissen- 
schaftliche Programm kann inzwischen vom Sekretar der Gesellschaft, Mr. M. J. R. 
Healy, erhalten werden. Anfragen von allgemeinem Interesse richte man an den 
Vorsitzenden des “Local Arrangements Committee’’, (siehe unten). 


Smtrosio pt GENETICA BIOMETRICA E 
Quarta CONFERENZA INTERNATIONALE DI BIOMETRIA, 
Ortrawa, 28-31 Agosto, 1958 


Questa conferenza ed il Simposio, ch’é sotto gli auspici dell’ Unione Internationale 
di Scienze Biologiche, sono adesso organizzati ed il programma sara pubblicato nell’ 
anno 1957. Frattanto, informazioni sul programma scientifico provvisorio possono 
essere ottenute dal segretario di questa Societa, Signor M. J. R. Healy. Questioni 
su altri dettagli dovrebbro essere indirizzate al Presidente del Comitato Locale— 
vedi sotto. 


Dr. G. B. Oakland 
Chairman, Local Arrangements Committee: Science Service Building 
Department of Agriculture 
Carling Avenue, 
Président du Comité d’Organisation locale: Ottawa 3, Ontario, 
Canada 
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of moments, method of steepest 
descent, Newton’s method, recur- 
rence formulas 
accuracy of, 79 
approximate, see inefficient estimation 
automatic, 110 
of adjusted means, 365 
of summary tables, 355 
punched card, 229, 451 
simplified, 311, 521 
Confidence limits, 40, 220, and see tests 
estimation of simultaneous, 51, 70 
for bioassay, 491 
for covariance, 413 
for ratio, 493, 506 
for variance ratio, 51 
Confounding, 2, 233, 247, 260, 348, and 
see analysis of variance, compari- 
sons, design of experiments, factorial 
experiments 
partial, 11, 352 
Constraints, 345, 362 
Contingency tables, see chi square, corre- 
lation 
missing values, 47 
Contrasts, see comparisons 
Control chart, 472 
Correlation, see attenuation, causation, 
contingency, covariance, path coef- 
ficients, regression 
analysis of, see path coefficients 
autocorrelation, 455 
between relatives, see statistical ge- 
netics 
canonical, 478 
intraclass, 84 
partial, 226 
Covariance, see correlation, matrices, 
regression, variance 
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adjustment, 23, 365 
analysis of, 84, 362, 386, 447, 452, 481 
conditional, 273 
matrix, 81 
of estimates, 16, 505 
of frequencies, 273 
Critical region, 31 
Crossing over, see linkage 
Crossover design, 504 
Cumulants, see moments 
Curve fitting, see estimation, orthogonal 
polynomials, regression, smoothing 
Cycles, 335 
Defining contrast, 3 
Degrees of freedom, 49, and see com- 
parisons 
effective, 44, 102 
Design of experiments, 1, 31, 68, 226, 
233, 330, 345, 364, 383, 449, 479, 
and see aliases, central composite 
designs, components of variance, 
confounding, estimation designs, 
factorial experiments, fractional rep- 
lication, guard rows, incomplete 
blocks, lattices, linked blocks, long- 
term experiments, nonfactorial ex- 
periments, organolepsis, paired com- 
parisons, perennials, plaid squares, 
plant spacing, plot size and shape, 
polytopes, precision, quasi-latin 
squares, randomization, randomized 
blocks, replication, sampling, se- 
quential experiments, strip plots, 
tree crops, Youden squares 
for locating optimum, 20 
genetic, 218 
judging, 401 
organoleptic, 401 
Difference equations, 158, and see dif- 
ferential equations 
Differences of zero, 271 
Differential equations, 58, 157, and see 
difference equations 
partial, 59 
Digitalis, 492 
Discriminant function, 226, 232, 478, and 
see multivariate analysis, scores 
Disproportionate subclasses, 107,. 338, 
and see analysis of variance, fitting 
constants, missing values 
Distribution, see binomial, chi square, 
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moments, Poisson, runs, hyperbolic 
secant 
approximate, 101 
asymptotic, 59 
exact, 213 
fitting, see estimation 
frequency, 58 
joint, 58 
noncentral, 31 
of chi square, 100 
of classes, 211 
of. ratio, 494 
Dominance, 415 
Dose response curve, 43, 72, 227, 233, 
495, 574, and see time response 
curve 
multi-hit, 123 
Dual, 369 
Dummy variates, 27, 362 
Ecology, 154, 163, 453, and see entomol- 
ogy, fish, genetics, populations 
catchability, 183 
ED 50,311 
tables, 317 
Effective temperature, 475 
Efficiency, 81, 222, 231, and see extima- 
tion, maximum likelihood, suffi- 
cient statistics 
Eigen values, 62 
Endocrinology, 261 
Entomology, 515, and see toxicology 
Errata, 210, 221, 339 
Error, 
experimental, 17 
heterogeneity, 82, 391 
mean square, see variance 
propagation of, 123 
regression, see regression 
sources of, 335 
term, proper, 354, 387, 481 
theory of, see analysis of variance, 
least squares, maximum likelihood, 
models, rejection of data 
Type I, 31 
Type II, 450 
Estimation, 162, 211, 419, 449, and see 
bias, combination of estimates, co- 
variance adjustment, efficiency, fit- 
ting constants, information, least 
squares, maximum likelihood, meth- 
od of moments, minimum chi 
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square, Neyman-Pearson theory, 
populations, scores, sufficient sta- 
tistics, theory of errors 
designs, 3 
graphical, 123 
grouped data, 269 
inefficient, 323 
of confidence intervals, 51 
of variance, 79 
simultaneous, 51, 81, 156 
unbiased, 433 
Evolution, 209 
Expectation, mathematical, 213, 265, 
408, 417, and see mean, moments 
of mean square, 28 
Experimental design, see design of ex- 
periments 
Extrapolation, 334 
Factorial experiments, 15, 82, 259, 381, 
449, and see bioassay, fractional 
replication, nonfactorial experiments 
analysis of, 51, 111, 245, 345 
Fatigue, 128, and see organolepsis 
Faulty design, 345 
Fiducial limits, see confidence limits 
Field experiments, 82, 226, 228, 229, 449, 
and see agronomy, design of experi- 
ments, long-term experiments, plot 
size and shape, soil 
Fish, 449, and see ecology 
Fitting constants, 362, and see dispro- 
portionate subclasses, least squares, 
missing values 
Fitting distribution, see estimation 
Fitting regression line, see covariance, 
least squares, regression 
both variables subject to error, 445 
Food technology, 499, and see nutrition 
Forestry, 449 
Fourfold table, see chi square, con- 
tingency 
Frequencies, trends in, 58 
F test, 23, 52, and see analysis of variance 
biased, 224 
errors heterogeneous, 82 
interpretation of, see tests 
multiple, see multiple range test 
Future action, 68, 229 
Gamma function, see Stirling’s approxi- 
mation, trigamma function 
Gegenbauer polynomials, 62, and see 
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orthogonal polynomials 
Genetic correlation, 84, 199, 404, and see 
genetic covariance, heritability, path 
coefficients, repeatability 
Genetic covariance, 404, and see genetic 
correlation, genetic variance 
Genetic drift, random, 57 
Genetics, see allelism, animal breeding, 
chromosomes, dominance, evolu- 
tion, genetic, heritability, heterosis, 
inbreeding, linkage, natural selec- 
tion, number of loci, paternity test, 
path coefficients, repeatability 
human, 226 
population, 190, 404, 452 
statistical, 190 
Genetic variance, 404, 415 
components, 433 
Graphics, 167, 451, and see computation 
Grouping, 85, 269 
Growth, 336, 455, and see biometry, 
logistic curve 
Guard rows, 350 
Heredity, see genetics 
Heritability, 453, and see genetic correla- 
tion, repeatability 
confidence limits for, 99 
estimation of, 84 
Heterosis, 430 
Horticulture, 334, 381, and see agron- 
omy, perennials, tree crops 
Hyperbolic secant, squared, 399 
Hypergeometric, 61 
Hypothesis, see models, tests 
non-null, 31 
null, 26 
Identification, 25, 230 
Inbreeding, 201, 434 
Incomplete blocks, 301, and see lattices, 
linked blocks, randomized blocks, 
Youden squares 
partially balanced, 369, 452 
Incomplete experiments, see missing 
values 
Index of dispersion, 264 
for truncated Poisson, 275 
Industrial research, 1, 127 
Infection rate, 154 
Inference, 32, and see Neyman-Pearson 
theory, tests 
fiducial, 32 
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Information, 25, 35, 74, 253, 399, 513, 
and see analysis of variance, com- 
bination of data, design of experi- 
ments, estimation, maximum likeli- 
hood, precision 

interblock, 352, 370 
intrablock, 352 
Interaction, 1, 481, and see analysis of 
variance, models 
assumed zero, 2 
confounded, 357 
genetic, 415 
Iteration, 49, 79, 302, 514, and see com- 
putation, least squares, maximum 
likelihood 
Jacobian, 64 
Jacobi polynomials, 63, and see orthog- 
onal polynomials 
Judging, 128, and see organolepsis, selec- 
tion of judges 
K statistics, 433, and see moments 
Latin squares, 226, 233, 333, 365, 383, 
and see quasi-latin squares 
Lattices, 
simple, 405 
square, 370 

LD 50, see ED 50 

Least significant difference, 372 
test, 230 

Least squares, 174, 224, 399, 422, and see 
accuracy, adjusted means, analysis 
of variance, estimation, fitting con- 
stants, matrices, normal equations, 
precision, theory of error 

Likelihood, 31, 164, 212, and see maxi- 
mum likelihood 

Limnology, see fish 

Linkage, 208, 413, 415, 442 
correlation with genetic interaction, 
421 

Linked blocks, 369 

triangular, 369 

Loci, number of, 216, 433, and see 

genetics 


Logistic curve, 399, 455, and see growth, © 


logit transformation, populations 

Long-term experiments, 330, 345, and see 
tree crops 

Main effect, see analysis of variance, 
interaction 

Markov, 57 
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Matching, see paired comparisons 
Mathematical biology, see biometry 
Matrices, 232, and see covariance matrix, 
matrix: algebra, multivariate anal- 
ysis 
inverse, 346, 363 
Kendall, 400 
Matrix algebra, 205, 256, and see ma- 
trices 


~ Maximum likelihood, 25, 48, 163, 2il, 


301, 513, and see efficiency, estima- 
tion, information, theory of error 
biased, 222 
estimate of variance, 169, 215 
graphical, 163 
iterative computation, 79, 165 
yields no estimate, 175, 213 
Mean, see moments 
adjusted, 251, 365, 370, and see co- 
variance, least squares, regression 
ratio of, 520 
Mean deviation, 231 
Medicine, 229, 283, and see blood groups, 
clinical, genetics, pharmacology, 
physiology, toxicology 


‘Metameter, 492, and see scales 


Metric, see scales 

Microbiological assay, 501, and see bac- 
teriology, bioassay 

Minimum chi square estimate, 174 


Missing values, 23, 121, 224, 241, 338, 


508, and see fitting constants, rejec- 
tion of data 

computation of, 13, 241 

contingency tables, 47 

effect on tests, 224 

variance of, 14 

Mixed-up values, see missing values 
contingency tables, 47 


‘Models, see biometry, causation, com- 


ponents of variance, hypothesis, 
path coefficients, regression, theory 
of error, transformations 

genetic, 104, 410 

mathematical, 13, 24, 45, 73, 99, 227, 
229, 234, 339, 386 

statistical, 434 

stochastic, 155, 451 

Moments, 58, 265, 275, and see k 

statistics 

generating function, 101, 266 
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method of, 101 
Morbidity, 154 
Mortality, 227 
Moving averages, 509, and see smoothing 
Multinomial distribution, 165, 211, 265 
Multivariate analysis, 67, 190, and see 
analysis of variance, discriminant 
function, matrices 
Mutation, lethal, 216 
Newman-Keuls test, see multiple range 
test 
Newton’s method, 80 
Neyman-Pearson theory, 31, and see 
estimation, inference 
Nomography, 477 
Nonfactorial experiments, 481 
Non-orthogonal design, 346, 367 
Normal equations, 193, 362, and see least 
squares, matrices 
Normality, 398 
Nutrition, 523, and see food technology 
Obituaries, 340 
Organolepsis, 127, and see comfort, fa- 
tigue, judging, scores, selection of 
judges 
flavor, 234, 381 
quality control, 234 
randomization in, 128 
serial, 234 
simultaneous, 234 
Orthogonality, 9, 262, and see com- 
parisons 
Orthogonal polynomials, 362, “498, and 
see Gegenbauer, Jacobi 
Pairing, see paired comparisons 
Palatability, see organolepsis 
Parasitology, 154 
Pascal problem, 227 
Paternity tests, 226, and see blood 
groups, genetics 
Path coefficients, 190, and see causation 
Percentages, see proportions 
Percentiles, 228 
Perception, sce organolepsis 
Perennials, 330, and see tree crops 
Pharmacology, see bioassay, toxicology 
Physical science, see industrial research 
Physiology, 89, 233, 475, and see endo- 
crinology, medicine, threshold 
Plaid squares, 348 
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Plant spacing, 82, 350, and see plot size 
and shape 
Plot size and shape, 82, 226, 228, 333, 
and see guard rows, plant spacing, 
split plots, strip plots 
Poisson distribution, 213, 264, 280 
compound, 264 
truncated, 265 
Polya-Eggenburger distribution, 451 
Polytope, 20 
Populations, see census, distributions, 
fish, statistical genetics 
changes, 182, 230 
estimates, 163, 174, 463 
number of classes, 211, 454 
structure, 455 
Poultry, 107 
Power, 31, 34, and see tests 
Precision, 494, and see accuracy, infor- 
mation, least squares 
Prediction, 477, and see regression 
accuracy of, 478 
Preferences, see organolepsis 
Probability, 31, 211, and see runs 
Proportions, 83, 513, and see binomial 
Pseudo-factors, 246, 259 
Quality, see organolepsis 
Quality control, 465 
Quantification, see scales 
Quasifactors, see pseudo-factors 
Quasi-latin square, 245 
Radiology, 89 
Random dice, 450 
Randomer, 450 
Randomization, 128, 335, 450, 480, and 
see selection 
Randomized blocks, 348, and see incom- 
plete blocks 
Random mating, 206 
Range, see tests using range 
significant studentized 
Ranks, 128, 301, 399, and see scores, 
transformations 
Recurrence formulas, 203 
Regression, see adjusted means, canon- 
ical analysis, correlation, covariance, 
fitting regression line, orthogonal 
polynomials, trend 
additional variable, 232 
adjustment, 23 
analysis, 174, and see analysis of co- 
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asymptotic, 323 
deviations from, 486 
error, 363 
independent variable affected by treat- 
ments, 452 
model, 226, 230, and see structure 
multiple, 192, 362 
standardized, 193 
weighted, 174 
Rejection of data, 84, 499, and see miss- 
ing values, selection 
Repeatability, 84, and see genetic corre- 
lation, heritability 
Replication, 
fractional, 1, 259, and see aliases in 
mixed series, 1 
in time, 229 
Research in statistical methods, 449 
Residuals, see errors 
Respiration, 89 
Response, see bioassay, dose response 
curve, models, time response curve 
critical, 75 
graded, 72 
quantal, 72, 512 
Result-guided procedures, see tests of 
significance of results suggested by 
data 
Runs, 227 
Sample size needed, 143, 179, 213, 231, 
287 
Samples, small, 264 
Sample surveys, 449, 462, and see 
sampling 
error control, 462 
interviewer training, 464 
Sampling, 143, 449, and see components 
of variance, design of experiments, 
sample size needed, sample surveys 
ecological, 154, 182, 453 
error, see variance 
multistage, 229 
nested, 234, 434 
stratified, 283, 315 
studies of statistical problems, 106, 
175, 264, 290, 511 
surveys, 331 
time series, 231 
Scales, 87, 381, 398, 416, and see scores 
Scores, 127, 394, and see discriminant 
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function, organolepsis, ranks, scales 
matrix of, 400 
tournament, see Kendall matrix 
Screening tests, 311, and see bioassay 
Selection, see choice of transformation, 
design of experiments, randomiza- 
tion, rejection of data, sampling 
natural, 216 
of dosages, 312, 323, 491 
of estimate, 222 
of experimental units, 333 
of interviewers, 464 
of judges, 393 
of levels, 312, 323 
of method of statistical analysis, 67, 
387 
of order of treatments, 509 
of variates, 68 
Sensitivity data, see quantal response 
Sensory tests, see organolepsis 
Sequential experiments, 229, 283 
Smoothing, 161, and see moving averages 
Soil heterogeneity, 228, 362 
Split plots, see strip plots 
covariance adjustments, 23 
tests of significance, 23 
Standard deviation, see variance 
Standard error, see variance 
Standard treatment, 491 
Statistical control, see analysis of co- 
variance 
Statistics texts and periodicals, 449 
Steepest ascent (descent), method of, 21 
Stirling’s approximation, 170 
Stochastic processes, 57 
Stratification, see sampling 
Strip plots, 347, and see split plots 
Structural analysis, 445 
Structure, 31, 226, 230 
Subjective evaluation, see judging, or- 
ganolepsis 
Successive approximation, see iteration 
Sufficient statistics, 213, and see effi- 
ciency, estimation 
Summary tables, 355 
Surveys, 331, and see sample surveys 
Survival curve, see dose response curve, 
time response curve 
Survival time, see time response curve 
Systematic designs, 228 
Tables, miscellaneous, 54, 279, 449 
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graphical, 168, 172, 220 
Taste tests, see organolepsis 
Teaching of statistics, 93 
methods, 95 
Tests, 449, and see analysis of variance, 
chi square, confidence limits, F test, 
least significant difference, null hy- 
pothesis, ranks, rejection of data 
a posteriori, 230 
approximate, 224 
biased, 224 
correlated, 29, 69 
exact, 224, 264 
likelihood ratio, 264 
logic of, 31 
multiple comparisons, 230 
multiple range, 307, and see multiple 
F test 
Duncan’s, 307 
normal deviate, 272 
of significance, 354 
of significance of 
difference between adjusted 
means, 23 
difference between means, 230 
difference between proportions, 
283 
results suggested by data, 358 
variance, 264 
“‘zero”’ class, 268, 282 
studentized, 230 
using range, 230 
Theory, see biometry, hypothesis, model 
Threshold, 491 
Time response curve, 73, and see dose 
response curve 
Time series, 334 
Tolerance, 72 
Toxicology, 227, 233, 515, and see bio- 
assay, pharmacology 
Transformations, 85, 386, and see addi- 
tivity, analysis of variance, bioas- 
say, canonical analysis, logistic 
curve, matrices, models 
angular, 513 
choice of, 84, 513 
effect of, 123 
logit, 74, 513 
normal deviate, 513 
probit, 74, 513 
squared hyperbolic secant, 399 
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square root, 451 


Tree crops, 334, 345, and see long-term 


experiments 
Trend, 334, 362, and see regression 
Trigamma function, 170 
Twins, 85 
Uniformity data, 481 
Variance, see covariance, F test, genetics, 
mean deviation 
analysis of, 9, 41, 68, 84, 100, 142, 250, 
345, 369, 428, 447, 450, 481, 507, 
and see additivity, analysis of co- 
variance, chi square, combination of 
data, components of variance, de- 
grees of freedom, disproportionate 
subclasses, errors, fitting constants, 
F tests, least significant difference, 
least squares, long-term  experi- 
ments, missing values, models, mul- 
tiple F tests, multivariate analysis, 
orthogonal polynomials, path coeffi- 
cients, regression, structural analy- 
sis, tests, transformations, uniform- 
ity data 
computation of, 110, 345, 390, 498 
of percentages, 85 
population finite, 229 
relation to design, 365 
summary tables, 355 
approximate, 187 
asymptotic, 174, 215 
average, 34, 358, 372, 407 
binomial portion, 84 
components, 99, 143, 229, 358, 405, 
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423, 505, and see components of 
covariance, structural analysis 
computation of, 339 
confidence limits for, 450 
interpretation of, 505 
variance of, 433 
conditional, 273 
error, 68, 445, 502 
exact, 359 
expectation, 99 
homogeneity of, 83, 265, 279, 391, 398 
negative, 445 
of adiusted mean, 251, 366, 372 
of difference of adjusted means, aver- 
age, 34 
of estimate, 218 
of frequency, 273 
of mean, 143 
ratio, confidence limits for, 99 
related to mean, 82, 85 
sampling, 169, 180 
Variate, fixed, 31 
Variate, random, 31 
Vitamins, 523 
ascorbic acid, 499 
By, 501 
Weighting, 25, 123, 325, 371, 434, 513 
Working angle, 514 
maximal, 516 
Working probit, 513 
Youden squares, 451 
Zero, see differences of zero, tests of 
significance of ‘“‘zero”’ class 
Z test, see F test 
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NOTICES OF MEETINGS 
Notices are invited of forthcoming meetings or sessions of interest to readers. 


Date, 1957 MEBTING LocaTIon 
Feb. 4-8 American Society for Testing Benjamin Franklin 
Materials, Committee Week Hotel, Philadelphia, 
Pa., U.S.A. 


March 7-9 Institute of Mathematical Sta- | Catholic University, 
tistics (Eastern Regional Meet- Washington, D.C., 


ing) and Biometric Society U.S.A. 
(Eastern North American Region) 

May 12-16 _Institute of Food Technologists, Pittsburgh, Pa., 
17th Annual Meeting U.S.A. 

May Eleventh National Meeting, Philadelphia area, 
Operations Research Society of U.S.A. 
America 


July 24-29 International Union of Nutritional Paris, France 
Sciences, Fourth International Con- 
gress of Nutrition 


August 8-15 30th Session, International Sta- Stockholm, Sweden 
tistical Institute, and Congress of 
the International Union for the 


Scientific Study of Porulation 
August 25-29 American Institute of Biological Stanford University, 
Sciences Stanford, Cal., 
U.S.A. 
Sept. 2-6 International Conference on Oxford, England 
Operational Research 


Sept. 10-13 American Statistical Association, Atlantic City, 
Institute of Mathematical Sta- N.J., U.S.A. 
tistics (Annual Meeting), 

ENAR Biometric Society 


Nov. 18- Pacific Science Association, Bangkok, Thailand 
Ninth Congress 
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BACK ISSUES 


Back issues of Biometrics are available at the following postage-paid @ : 
prices in U.S.A. currency: 


Price per Price per ._ 
Volume Number Single Number Volume(unbound) 


1to6 
1 to 6 
lto4 
lto4 
lto4 
lto4 
lto4 
lto4 
lto4 
lto4 
lto4 
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Inquiries, non-member subscriptions and orders for back issues should be 7 
addressed to: 
BIoMETRICS, 
National ResearcH CounciL, 
Orrawa 2, CANADA. 


4 
| 
Year 
1945 $ 
1946 
1947 
1948 
1949 
1950 
1951 
1952 
1953 
1954 
1955 q 
1956 


| 
aa 
| 


